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MULTIPLE FACTOR ANALYSIS OF ACADEMIC 
ACHIEVEMENT: A COMPARATIVE STUDY OF 
FULL-BLOODED INDIAN AND 
WHITE CHILDREN’ 


OMER JOHN RUPIPER** 
University of Kansas 


I. Introduction 





THIS STUDY had its origin in the studies con- 
ducted by the Bureau of Indian Affairs and the Un- 
iversity of Kansas during the period 1951-55 (3, 
13). This Indian Education Research Project is 
undoubtedly the most comprehensive study of In- 
dian and white children whichhas been made. The 
program explored the educational status of Indian 
children and emphasized two functions: predictive 
testing and achievement testing. The purpose of 
the predictive testing was an attempt to investi- 
gate high-school graduates’ academic skills and 
to predict their probable success if they consider 
further formal education. The achievement test- 
ing was designed toservetwo purposes: (a) a con- 
tinuing evaluation of the status of educational 
achievement of Indian children for administrative 
use, that is, the recording and interpreting of test 
scores for describinga pupil’s educational 
achievement as measured by tests, and (b) the use 
of data obtained as a functional part of the instruc- 
tional program. These studies showed that Indian 
children differedfrom white children as measured 
by achievement test results and that the smaller 
the proportion of full-blooded and non-English 
speaking children in a specific group of Indian 
children the higher the group achieved. It was 
recognized that these differences in educational 
achievement depended ondifferences in experien- 
tial, environmental, and cultural factors and not 
necessarily on innate ability alone. 

The present study was an attempt to isolate and 
compare the factor structures and sets of factor 
patterns obtained from the achievement test scores 
for full-blooded Indian children and white children. 
The purpose of this study was to investigate the 
degree of relationship between the subtests of the 
measuring instrument, to determine the number 
of independent factors which account for the scores 
on the subtests, how heavily each of the subtests 
is weighted with the factors, and which factors 
appear as a whole for the different groups of sub- 
jects. A comparison of factors by sex, grade 





* All footnotes will be found at end of article. 





level, and race was made. 

Although there has been agreatdeal of research 
in the literature which concerns intelligence and 
educational achievement of American Indians, no 
studies were evident in which factor analysis had 
been applied to achievement test scores for the 
purpose of comparing full-blooded Indian and 
white children onlarge samples. Perhaps the only 
attempt was made by Harris (33) in which corre- 
lations were obtained between 15 language skill 
variables in four groups of fifth-grade Indian stu- 
dents. The intercorrelations were too irregular 
for any definite factors to emerge. Experimental 
studies on intelligence with Indians (20, 26, 27, 28, 
29, 30, 34, 36, 39, 46) indicate that differences in 
test performance between Indian and white chil- 
dren are not necessarily due to differences in in- 
nate capacity. The evidence indicates that white 
children and mixed-blooded Indian children did 
better where language was vital. The results 
showed that Indians scored below age and that fe- 
males performed better than males on the intelli- 
gence tests used, but these results seem meager 
and inconclusive. 

Anthropological studies (37, 39, 40, 41) indicat- 
ed that there were no significant differences in 
results of performance intelligence tests for Indi- 
an children as comparedto white children. Intel- 
igence tests of a verbal nature, requiring writing, 
resulted inlower IQ’s for Indian children. The 
educational achievement of Indians in public 
schools, federal schools, and mission schools as 
measured by test scores resulted in a hierarchi- 
cal order, respectively (2,13,42,43,52). Indian 
children with highest degree of Indian blood and 
non-English speaking ability at the time of enter- 
ing school obtained the lowest test results. It was 
believed that cultural factors played an important 
role in the learning process of these children. 

Factor analytic studies on intelligence and age 
with white children (1, 4, 5, 7, 9, 14,17, 19, 23, 24, 
25, 38, 44,45, 47,48) emphasized that a general 
factor was established early in life. However, 
conflicting evidence existed as to the first factor 
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changing with increasing age. Where factor pat- 
terns did change, there was no consistent trend in 
evidence. Factor studies regarding levels and sex 
for white children (35, 49, 50, 51) revealed a gen- 
eral agreement as to no differentiation in factor 
patterns between boys andgirls. There was indi- 
cation of conflicting evidence regarding the theory 
of the general factor playing a less important role 
as special abilities develop. The factor patterns 
change relatively little from grade level to grade 
level, and where changes did occur the patterns 
did not exhibit any systematic trend. Mental fac- 
tors seemed to depend on background, education, 
and experience of the subjects involved. 

Where educational skills for white children 
were factor analyzed (6,8, 10, 11, 12, 15, 16, 18, 32, 
53) it was apparent that educational skills were 
rather complex with regard to the identification of 
component factors, that boys and girls vary ac- 
cording to different academic skills, and the more 
simple and relevant the test the more clearly it 
produced the factor. Coleman (11) pointed out that 
a pupil’s basic learning difficulty can be diagnosed 
at an earlier age than was formerly believed and 
that school achievement depended upon motivation 
and other factors. 

In consideration of the review of literature, 
five important conclusions were drawn from the 
results: 1) Factors other than innate capacity 
were operating on test performance of both Indian 
and white children; 2) because of contradictory 
findings, no conclusive statements can be made as 
to sex and racial superiority on test performance; 
3) factor patterns change relatively little from 
grade to grade withno significant changes between 
sex; 4) a common‘factor is established relatively 
early in life with doubt as to its role becoming 
minimized as specific abilities devel op with in- 
creasing age, and 5) because of the complexity of 
educational skills, it was difficult to identify their 
component factors. 


Hypotheses 


On the basis of these studies, one hypothesis 
concerning differences in achievement raw score 
means by grade level as to race and sex was de- 
veloped. Five hypotheses were formulated per- 
taining to the number, size, and pattern of factors 
of achievement scores as measured by the Califor- 
nia Achievement Test Batteries. The hypotheses 
are: 


1. No differerces exist in achievement raw 
score means on the subtests for the elementary, 
intermediate, and advanced grade levels for each 
race by sex. 

2. The same number of factors will appear at 
each grade level for the race groups by sex. 

3. The factor loadings will be of the same mag- 
nitude for all groups. 





4. Higher factor loadings of a verbal nature 
will appear for white children than for full-blooded 
Indian children. 

5. Higher factor loadings of a numerical nature 
will appear for white children than for full-blooded 
Indian children. 

6. A common factor will appear throughout all 
the subtests for all groups. 


II. Procedure 


The Testing Program 





A fall achievement testing program was con- 
ducted by the United States Bureau of Indian Af- 
fairs and the University of Kansas during the per- 
iod 1951-55. The California Achievement Test 
Battery was used throughout the entire testing pro- 
gram for grades four through twelve. The chil- 
dren attending federal schools, public schools, and 
mission schools were tested in the Albuquerque 
and Phoenix areas in 1951, Aberdeen areain 1952, 
Billings area in 1953, and the Anadarko and Mus- 
kogee areas in 1954. Thetests were administered 
by Indian Bureau and public and mission school 
personne! in their respective schools. The public 
and mission schools included were those which op- 
erated in the vicinity of the federal schools includ- 
ed in this study and they enrolled a considerable 
proportion of Indian pupils. All answer sheets 
were sent to the University of Kansas for inspec- 
tion, necessary processing, andscoring. The raw 
scores of each subtest for each individual were 
recorded and tabulated. These were punched on 
IBM cards with background information for each 
child. 


The Population 





The geographical locations of the administra- 
tive areas of the United States Bureau of Indian 
Affairs in which testing was conducted were: 
1) Phoenix area— Arizona, exclusive of the Navajo 
and Hopi, 2) Albuquerque area—New Mexico and 
southern Colorado, exclusive of the Navajo, 3) 
Aberdeen area—South Dakota, North Dakota, and 
northeastern Nebraska, 4) Billings area—Montana 
and Wyoming, 5) Muskogee area —eastern Okla- 
homa and Mississippi, and 6) Anadarko area— 
western Oklahoma and Kansas. Between the years 
1951 and 1954, inclusive, 13,636 Indian children 
and 9922 white children were tested in federal, 
public, and mission schools. These 23, 608 chil- 
dren lived in rural areas either on farms, ranches, 
reservations, or small communities. 

The aggregate of full-blooded Indian children 
attending federal schools and white children attend- 
ing public schools combined in all areas were 
chosen for this study. The total number of sub- 
jects, 14,888, were separated as to full-blooded 
Indian children and white children by sex and 





RUPIPER 


TABLE I 


DISTRIBUTION OF SUBJECTS BY RACE, SEX, AND 
GRADE LEVELS 





Full-Blood Indian Children 





Males Females 








Elementary Level 
Grades h, 55 6 


Intermediate Level 
Grades 7, 8, 9 


Advanced Level 
Grades 10, 11, 12 























Elementary Level 
Grades 4, 5, 6 


Intermediate Level 
Grades 7, 8, 9 


Advanced Level 
Grades 10, ll, 12 

















Total = 1,888 
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grade levels. To conform to the suggested grade 
placement for the achievement tests, the grade 
levels were combined into elementary (grades 4, 
5, and 6), intermediate (grades 7,8, and 9), and 
advanced (grades 10,11, and 12). Table I shows 
the breakdown of the subjects and the number in 
each group with percentages. The approximate 
age ranges for the federal Indian children were 
from eight to 17 years for the elementary level, 
10 to 19 years for the intermediate level, and 15 
to 24 years for the advanced level. For public 
school white children the approximate age ranges 
were from eight to sixteen years for the elemen- 
tary level, 11 to19 years for the intermediate 
level, and 15 to 20 years for the advanced level. 


The Test Battery 


The California Achievement Test Battery, 
Form AA, was utilized to measure the subjects’ 
achievement in fundamental reading, arithmetic, 
and language skills through the entire testing pro- 
gram. These tests were chosen because they were 
already being used wideiy in Indian Bureau schools, 
they were selected by Indian Bureau personnel as 
being as close to content of the curriculum as any 
of the other achievement batteries, and they were 
machine scored. Thetests of basic skills used 
were: a) reading vocabulary, b) reading compre- 
hension, c) arithmetic reasoning, d) arithmetic 
fundamentals, e) mechanics of English and gram- 
mar, and f) spelling. Since the sections of each 
test contained such a small number of items, only 
total scores for each test were obtained rather 
than scores for each sectionofthe test. The scor- 
ing formula was the number of right responses for 
each of the sixtests. Since the California Achieve- 
ment Test Battery is widely known and is used as 
a test of academic achievement, a discussion of 
its construction, reliability, and validity is unnec- 


essary. 


The Factorial Analyses 





The raw scores for each individual were punched 
on IBM cards. Tabulating machine operations 
were used in obtaining frequency distributions, 
sums of squares, and sums of cross products for 
each test by race, sex, and grade level groups. | 
The means, medians, standard deviations, ranges, 
and indices of skewness were computed from the 
frequency distributions. There were only slight 
discrepancies with no differences between the me- 
dian and mean greater than three rawscore points. 
In many cases the differences were less than one. 
Many of the distributions were slightly skewed 
which indicated that the tests could have been 
somewhat too difficult for the students tes ted. 
However, the degree of skewness was not suffi- 
ciently large where the assumption of normality 
could not be met. A rather consistent trend was 





noted in the variability among races, sexes, and 
grade level groups. In general, the standard de- 
viation is considerably larger on the Reading, Vo- 
cabulary, and Arithmetic fundamentals test than 
on the other four tests in the battery for the var- 
ious groups. Pearson product-moment correla- 
tion coefficients were determined among the six 
tests by race, sex, andgrade level. Twelve cor- 
relation matrices for the various groups were 
formulated, i.e., a separate correlation matrix 
was constructed for males andfemales at the ele- 
mentary, intermediate, and advanced grade level 
groups for both full-blooded Indian children and 
white children. Thecorrelations were all positive 
with the lowest correlation of r = .35 between the 
Reading vocabulary test and the Arithmetic fund- 
amentals test at the advanced level for full-blood- 
ed Indian children. For the sample size N = 551 
this correlation differs from zero at the .01 level 
of confidence, as do all other coefficients of cor- 
relation in this study. 

The correlation matrices were factored by the 
centroid method of analysis (22, 31,54). The first 
extraction of factors was accomplished by using 
the highest correlation in each matrix as the esti- 
mated communality. After each factoring, ad- 
justments were made inthe diagonal cells and the 
process was continued until three factors were 
extracted. In order to obtain a better estimate of 
the communalities, the calculated communalities 
were used in the diagonal cells and the factoring 
was repeated. The new centroid factor loadings 
and the revised estimate of the communalities are 
listed in Tables IV through VII. After the second 
extraction of factors the residuals of the third fac- 
tor were sufficiently small to cease factoring. 

An oblique simple structure was obtained for 
the three factors. The method of extended vec- 
tors was utilized because, according to Thur- 
stone (54), where three dimensions are used the 
method gives the solution in one diagram if a 
simple-structure solution exists. This method 
seemed most appropriate since a central refer- 
ence vector existed on which all the test vectors 
have positive projections. The Extended Factor 
Matrices Eg are shown in Table VIII through XIX. 
Upon oblique rotation of the centroid factor load- 
ings for full-blooded Indian male children at all 
grade level groups, the third extracted factor 
rotated with the second did not reveal clear con- 
figurations, therefore, meaningful traces could 
not be drawn which would aid in the establishment 
and interpretation of these factors. The second 
factor residuals, range -.05to.04, for full-blood- 
ed Indian male children at all levels are suffi - 
ciently small to yield a meaningful third factor. 
Apparently, the third factor loadings for this group 
are error residuals. As aresult, only factorsI 
and II were used in oblique rotation for these 
three grade levels in which a single plane, two- 
dimensional rotationwas made. Tables VIII, XIX, 





RUPIPER 


TABLE II 


INTERCORRELATIONS BETWEEN THE RAW SCORES OF EACH TEST 
FOR FULL-BLOOD INDIAN CHILDREN BY SEX ON THE 
CALIFORNIA ACHIEVEMENT BATTERY 








Full-3lood Indian Children 





Males Females 
(N=1223) =1161) 
i= 2 3 4 
72) 56 761 69 | 54 
71 | 56 70 | 6&0 
73 70 71 
73 60 | 71 
68 | 5h 70 | 65 | 52 
65 | 48 66 | 65 | 53 


(N=972) (N=971) 
61] 51 | 64 62 | 54 | 47 
7. | 63 | 67 63 | 59 

78 | 70 5 63 71 

78 65 59 | 71 

70 65 2 64, 66 fo 8) 

56 | SO | 62 59 | Sh | U8 


(N=5 (N=62),) 
Advanced. 53 | 35 | 55 58 | 49 | 37 

61 | 52 | & 4l | 49 
70 63 61 72 
52 | 70 58 9 | 72 
60 | 63 | 58 él | 59 | 53 
60 | 52} 45 [5 57 | 48 | bé 


Note.--Decimals omitted for convenience. 




































































*Pest.--1. Reading Vocabulary, 2. Reading Comprehension, 3. Arithmetic 


Reasoning, . Arithmetic Fundamentals, 5. Language, 


6. Spelling 
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TABLE Ii 


INTERCORRELATIONS BETWEEN THE RAW SCORES OF EACH TEST 
FOR WHITE CHILDREN BY SEX ON THE CALIFORNIA 
ACHIEVEMENT BATTERY 








White Children 





Ma] 
2 3] L735 


76 | 75 67 
73 66 
73 65 
57 | 73 54 
67 | 68 
65 | 62 él 














= (N=] 
Inter- 76 | 67 67 
mediate 73 68 
73 68 
63 | 75 67 
68 | 68 : 
61 | 57 58 


a (N=1201) 
Advanced 72 | 63 | lg 59 
72 63 5k 71 62 61 
63 | 63 75 59 | 62 63 
49 | Su | 75 L6| 53 | 76 59 
58 | 58 | 67 | 58 61; 61] 63 
{61 | 53 | 53 | 3 62 | 56 | 18 64 





















































Note.--Decimals omitted for convenience. 


* Test.--1. Reading Vocabulary, 2. Reading Comprehension, 3. Arithmetic 


Reasoning, 4. Arithmetic Fundamentals, 5. Language, 


6. Spelling 
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TABLE IV 


CENTROID FACTOR MATRICES AND OBTAINED COMMUNALITIES 
WITH PROPORTIONS OF VARIANCES CONTRIBUTED BY 
THE CENTROID FACTORS (F,) 
(MALE) 








Full-Blood Indian Children 


~ Male 
actor Loadings _ Factor Variances 
tg il iil a! | ii? {| iit! 
632] 123 | 036 692 | 015 | OOl1 
840 | 162 | 155 706 | 026 | 02h 
888 | -192 |-131 789 | 037 | 017 
737 | -438 | 131 543 | 192 | 017 
788 | 100 | 096 621 | 010 | 009 
285 558 | 081 | 082 
32h 637 | 105 | 002 
079 694 | 006 | 022 
-281 7h1 | 079 | OO 
-353 619 | 12k | 002 
-036 663 | 001 | OO4 
317 591 | 100 | O11 
h25, 575 | 161 | 023 
184 648 | 034 | OO 
=272 674 | O74 | 023 
-25 523 | 181 | 028 
-070 587 | 005 | 00 
203 |-075 523_| oki | 006 570 


Note.--Decimals omitted for convenience. 




















Advanced 

















1 
2 
3 
4 
5 
6 
1 
2 
3 
h 
5 
6 
I 
3 
> 
6 











2Test.--1. Reading Vocabulary, 2. Reading Comprehension, 3. Arith- 
metic Reasoning, lh. Arithmetic Fundamentals, 5. Language, 


6. Spelling 
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TABLE V 


CENTROID FACTOR MATRICES AND OBTAINED COMMUNALITIES 
WITH PROPORTIONS OF VARIANCES CONTRIBUTED BY 
THE CENTROID FAC TORS (Fo) 
(FEMALE) 








Full-Blood Indian Children 





Female 

Factor Loadings Factor Variances 
I Uu Iil It | it' | iiit 
630] 175] 104 659 | 031 | OU 
863 | 110 | -070 745 | 020 | 005 
859 | -236] 10h 738 | 056 | O11 
735 | -315 | 017 540 | 099 | 000 
793 | 166 | -122 629 | 028 | 015 
776 | 114 | 157 602 | 013 | 025 
7 26, | O78 563. | O81 | OOO 
795 | Ol | 098 632 | 002 | 010 
816 | -276 | -039 666 | 076 | 002 
756 | =333 | 020 572 | 111 | 000 
814 | -032 | -157 663 | 001 | 025 
752 | 268 | -118 566 | 072 | Olh 
675 | 256 | -167 U56 | 067 | Ooo 
782 | 176 | -143 12 | 031 | 020 
820 | =-326 | -08), 672 | 106 | 007 
729 | -360 | 1h3 531 | 130 | 020 
769 | 109] 108 591 | 012 | 012 
715 | 2hh |} 191 511 | 060 | 036 











wn 


Ovi 








MEW nw HAMew we 


~ 
































Note.--Decimals omitted for convenience. 


*Test.--1. Reading Vocabulary, 2. Reading Comprehension, 3. Arith- 


metic Reasoning, . Arithmetic Fundamentals, 5. Language 


6. Spelling 
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TABLE VI 


CENTROID FACTOR MATRICES AND OBTAINED COMMUNALITIES 
WITH PROPORTIONS OF VARIANCES CONTRIBUTED BY 
THE CENTROID FACTORS (Fo) 
(MALE) 








White Children 





Male 
Factor Loadings Factor Variances 
‘an: iil a? | Itt 7 iit 
“B59 | 1761 170 738 | 031 | 029 
850 | 166] 0°9 723 028 010 
887 | -213 | lL 787 | OS | 020 
739 | -32h | -085 Su6 | 105 | 007 
787 | 065 | -099 619 | 004 | O10 
748 | 139 | -113 560 | 019 | 013 
837] 334) 093 701 | 111 | 009 
864} 109 172 746 | 012 | 030 
856 | -226 | 159 733 | O51 | 025 
78h, | -360 | -026 615 | 130] OOl1 
80h | ~025 | -106 6&6 | 0OOl| O11 
739 | 12 | -199 546 | 020 | OO 
B06] 341] 207 650 | 116 |] 043 
787 | 203} 197 619 | Obl | 039 
855 | -283 | 128 731 | 080 | 016 
750 | -334 | 089 563 | 112] 008 
780 | O51 | -197 608 | 003 | 039 
699 | 167 | -197 489 | 028 | 039 














Intermediate 





Advanced 


2 
3 
4 
5 
4 
1 
2 
bs 
h 
5 
6 
1 
2 
3 
h 
5 
6 





























Note.--Decimals omitted for convenience. 
"Test.--1. Reading Vocabulary, 2. Reading Comprehension, 3. Arith- 


metic Reasoning, 4. Arithmetic Fundamentals, 5. Language, 


6. Spelling. 
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TABLE Vil 


CENTROID FACTOR MATRICES AND OBTAINED COMMUNALITIES 
WITH PROPORTIONS OF VARIANCES CONTRIBUTED BY 
THE CENTROID FACTORS (Fp) 
(FEMALE) 








White Children 





Female 

Factor Loadings Factor Variances 
I il | Iii i! | Ir* | Iii? 
664 | 131 [-101 746 | 033 | O10 
866 | 078 | -068 750 | 006 | 005 
891 |-207 | 051 79 | 043 003 
764 |-29k | 135 58, | 086 | 018 
766 | 233 220 587 | 054 | 084 
756 | 164 |-017 572 | 027 | 000 
634 | 274 2 696 | 075 | 015 
844 | O91 | 191 712 | 008 | 036 
854 }=246 | 109 729 | O61 | 012 
815 |=301 | -05) 664, | 091 | 003 
811 | 009 | -068 658 | 000 | 005 
698 | 173 | -191 L487 | 030 | 036 
796 | 291 | -226 634 | O45 | 051 
798 132 | -129 637 017 017 
%2h |=357 | -097 679 | 127 | 009 
754 |=390 | 129 567 | 152 | 017 
796 | 072 | 19h 63 | 005 | 038 
728 | 271 | 152 530 | 073 | 023 














Intermediate 





Advanced 


























ANEW N HAM Ewen MaMewn 4 








Note.--Decimals omitted for convenience. 


*Test.--1. Reading Vocabulary, 2. Reading Comprehension, 3. Arith- 
metic Reasoning, |. arithmetic Fundamentals, 5. Language, 


6. Spelling 





RUPIPER 


TABLE VIII 


MATRICES OF OBLIQUE ROTATION OF AXES FOR FULL-BLOOD 
INDIAN CHILDREN, MALES, ELEMENTARY 
GRADE LEVEL GROUP 














Unrotated Factor Rotat ed Oblique Factor 
Matrix Fo Matrix V 








I ii - a B 
2032 ona el2k =.051 
. 8L0 162 463 7“. 014 
888 192 2152 | -.371 
2737 ~. 438 -.133 -.581 
- 788 100 2386 -.06), 
. 77 * 285 * 5L3 “~e 125 












































Matrix S= L. Direction Matrix/\. Direction Cosines 
Coefficients of a Normal of Reference Vectors Ap 
to Each Coordinate Plane 








Factor A! BI Factor 
i ebO |-.2l a 
II 1,00 1.00 II 












































Matrix C =A’A, Cosines of Extended Factor 
Angles Between Reference Matrix Ep, 
Vectors A pq 








Factor A B De 
A 1.000 1.20192 
B 833 1.1908 
1.12613 
1.35685 
1.2690) 


®Test:--1. Reading Vocabulary, 2. Reading Comprehension, 3. Arith- 
































metic Reasoning, . Arithmetic Fundamentals, 5. Language, 


6. Spelling 
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TABLE Ix 


MATRICES OF OBLIQUE ROTATION OF AXES FOR FULL-BLOOD 
INDIAN CHILDREN, MALES, INTERMEDIATE 
GRADE LEVEL GROUP 














Unrotated Factor Rotated Oblique Factor 
Matrix F, Matrix V 








i ii A B 
2195 | .32k e521 | .O8lL 
-333 079 229 | -.163 
-251 =.07 | -.516 
-787 | -.353 -.136 | =.56u 
-.036 2173 | -.268 




















317 507 | _.083 


























Matrix S = Le Direction Matrix A. Direction Cosines 
Coefficients of a Normal of Reference Vectors Ap 


to Each Coordinate Plane 








ctor A‘ BY Factor A B 
ms -230 z 2261 -.207 
i.00 1.00 II 966 2958 












































Matrix C =A’, Cosines of Extended Factor 
Angles Between Reference Matrix Eo 
Vectors A pq 


Factor A B De it iit 
A 1,000 225313 «406 -.95h 

B 851 1. 20048 2095 «179 
1.16144 -.326 O74 

1 . 27065 “« LL9 . 055 
1.22850 -.Oh) -.079 
|1.30039 e412 -.139 












































*Test.--1. Reading Vocabulary, 2. Keading Comprehension, 3. Arith- 


netic Reasoning, 4. Arithmetic Fundamentals, 5. Language, 


6. Spelling 
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TABLE X 


MATRICES OF OBLIQUE ROTATION OF AXES FOR FULL-BLOOD 
INDIAN CHILDREN, MALES, ADVANCED 
GRADE LEVEL GROUP 














Unrotated Factor Rotated Oblique Factor 
Matrix F, Matrix V 








' il A 
(50 425 2647 
805 18), 433 
e821 | -.272 -007 
723 =-0425 -.170 
-766 | -.070 180 
.723 | .203 Ol 









































Matrix S=L. Direction Matrix A. Direction Cosines 
Coefficients of a Normal of Reference Vectors A p 
to Each Coordinate Plane 








Factor BY Factor 5 
I ) =236 I 2339 
IZ__|1.00_| 1.00 U 2941 












































Matrix C =AA, Cosines of Extended Factor 
Angles Between Reference Matrix E, 
Vectors A pq 








De it iit 
231926 1 e OL] 2199 
i. 222k) 1 0229 | -.075 
iL. 21803} 1 -.331 0184 

1 
1 
1.000 


A iB 
A 1.000 
B 785 


@ 
ct 


AN wn He 

















1. 38313 -.588 =. 230 
1. 30548 -.091 | -.078 
- 38313 e 281 -.104 




















®*Test.--1. Reading Vocabulary, 2. Reading Comprehension, 3. Arith- 
metic Reasoning, . Arithmetic Fundamentals. 5. Language, 


6. Spelling 
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TABLE XI 


MATRICES OF OBLIQUE ROTATION OF AXES FOR FULL-BLOOD 
INDIAN: CHILDREN, FEMALES, ELEMENTARY 
GRADE LEVEL GROUP 








Extended Factor 
Matrix Eo 





Il 











Rotated Oblique Factor 
Matrix V 





A B 





_De 
T. 2082 
1.15875 
1.1614 
1.3605h, 
1.26103 
1.28866 


I 
1.000 
1.000 
1.000 
1.000 
1.000 
1.000 


e2il 
-162 
=.275 
-.129 
«209 
147 


7189 
OL 
050 
=.063 
=.0h5 
216 


-.019 
025 
°390 
oLh6 

-.012 
2030 























Matrix S= L. Direction 
Coefficients of a Norml 
to Each Coordinate Plane 





At Bt Ct 

eld 1k 
-1,00 18 
- .02 -1.00 


Factor 
I OL 
I] 36 
III 1.00 


























Matrix C =/A\'A. Cosines of 
Angles Between Reference 
Vectors / pq 




















Matrix A. Direction Cosines 
of Reference Vectors Ap 





C 
elt 
«176 

-.975 


Factor 
=< 





A 
2036 
II 0339 
III -9L0 























Matrix T. Direction Co- 
sines of Primary 
Vectors Tp 





Factor 


A 


B 


= 





A 


RB 


1,000 
- © 345 








e .052 


=. 345 
1.000 
-.128 


-.552 
-.128 
1.000 











a 


2. Reading Comprehension, 3. 
metic Reasoning, 4. Arithmetic Fun- 
damentals, 


>Pactor A should read Factor C 
Factor C should read Factor A 


Test.--l. 


a 


Reading Vocabulary, 


5. Language, 6. Spelling 


Arith- 





A 


B 








0969 
0181 
168 





2946 
-.316 
-076 














Matrix TT' = Rpq. 
of Angles Between Primary 


Vectors Tp 


Cosines 





Factor 


‘B 





A 
B 
C 





A 
1.900 
872 
2964 














RUPIPER 


TABLE XII 


MATRICES OF OBLIQUE ROTATION OF AXES FOR FULL-BLOOD 
INDIAN CHILDREN, FEMALES, INTERMEDIATE 
GRADE LEVEL GROUP 








Extended Factor 
Matrix Ep 





De __ ‘to: 
(1.33333 | 1. 0319 
1.25786 | 1. 2052 
1.2259 000 -.338 
1.32275 | 1.000}-.L0 
1.22850 | 1.000 }-.039 
1.32979 | 1.000} .356 





























Matrix S= L. Direction 
Coefficients of a Normal 
to Each Coordinate Plane 





Factor A‘ Bt cf 
I el2 236 209 
iI 226 -1. 221 
III 1.00 - .03 























Matrix C =A’'\, Cosines of 
Angles Between Reference 
Vectors /A pq 





Factor A B Cc 
A 0999 
B -.223 1.001 
Cc -.876 nen 136 1.000 

















®8Test.--l1. Reading Vocabulary, 
2. Reading Comprehension, 3. Arith- 
metic Reasoning, . Arithmetic Fun- 


damentals, 5. Language, 6. Spelling 








Rotated Oblique Factor 
Matrix V 





A _B 





e23e “*« 015 
+196 + 228 
075 537 
2023 | .569 
@e 065 . 310 
-OL0 006 























Matrix A . Direction Cosines 
of Reference Vectors Ap 





Factor mi 

T ~0b8 
II 2205 
III -.975 























Matrix T. Direction Co- 
sines of Primary 
Vectors T 





A B Cc 
e931 0913 e92l 
331 -. 108 337 
0153 -.00) -.197 























Matrix TT' = Rpq. Cosines 
of Angles Between Primary 
Vectors T 





tor A B 
1.000 
«714 | 1.000 
939 
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TABLE XIll 


MATRICES OF OBLIQUE ROTATION OF AXES FOR FULL-BLOOD 
INDIAN CHILDREN, FEMALES, ADVANCED 
GRADE LEVEL GROUP 














Extended Factor Rotated Oblique Factor 
Metrix E, Matrix V 


A B — G 
=. 009 =.030 
-.003 .075 
-.)09 055 

2085 «530 
°210 101 
2327 -.053 








rs 


Te T 
ry-TRILBT 1.0001 2382 
1.27877} 1.000] .225 
1.21951] 1.000], 298 
1.2717 | 1.000] -.b:9) 
1.30039 1.000 elli2 
1.29860} 1.000} .3L1 








ownew voedl 












































Matrix S= L. Direction Matrix A . Direction Cosines 
Coefficients of a Normal of Reference Vectors Ap 
to Each Coordinate Plane 








Factor At BY ct Factor B Cc 
~T 210 230 219 I 2285 | 182 

I 235 | -1.00 222 II -.949 | .20 
mz __| 1.00 |~.2h | -2.00 ur | -.133 | -.960 












































Matrix C =A’\A . Cosines of Matrix T. Direction Co- 
Angles Between Reference sines of Primary 
Vectors /\pq Vectors Tp 








Factor _ is A i] 
A | 1.000 2940 2590 
n -.1:10 1.000 2 2h9 -.4:52 
C -.817 -.031 0999 2233 2069 









































*"Test.--l1. Reading Vocabulary, 





Matrix TT' = Rpq. Cosines 

2. Reading Comprehension, 3. Arith- of Angles Between Primary 
Vectors Tp 

metic Reasoning, . Arithmetic Fun- 





Factor A B 

damentals, 5. Language, 6. Spelling A 1.000 
B 
C 





- 740 
-904 











OFactor A should read Factor C 
Factor C should read Factor A 











RUPIPER 


TABLE XIV 


MATRICES OF OBLIQUE ROTATION OF AXES FOR WHITE 
CHILDREN, MALES, ELEMENTARY 
GRADE LEVEL GROUP 














Extended Factor Rotated Oblique Factor 


Matrix E, Matrix V 








ie Tr A B C 

; LOL 3 20k 7209 ~| -.O0L | -.00h 
1.17647 2195 2229 -.005 -O2 
1.127h0 -.2h0 14h 377 =. 064 
1.35318 -.438 -.119 2423 -.0L9 
1.27065 083 .003 .050 208 
1.33690 186 011 -.030 231 





















































Matrix S=L. Direction Matrix A. Direction Cosines 
Coefficients of a Normal of Reference Vectors Ap 
to Each Coordinate Plene 














Factor A OF Factor A B C 
I -10 13 i 2094 0165 0127 
II 036 ea9 II 2337 -.972 185 
Iil 1.00 1.00 III 937 0165 | -.975 






































Matrix C = AA. Cosines of Matrix T. Direction Co- 
Angles Between Reference sines of Primary 
Vectors pq Vectors Tp 








Factor| A B A B C 
A 1.000 2 958 e92L 919 
B = 2157 2999 2192 -.387 ell 
Cc - .839 =.320 1.000 162 2046 -.149 
































®Test.--l. Reading Vocabulary, 





2. Reading Comprehension, 3. Arith- Matrix TT' = Rpq. Cosines 
of Angles Between Primary 


metic Reasoning, ). Arithmetic Fun- Vectors Tp 





damentals, 5. Language, 6. Spelling Factor A B 
A 1.900 
B - 825 1.000 


Cc 2951 ~840 
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TABLE XV 


MATRICES OF OBLIQUE ROTATION OF AXES FOR WHITE 
CHILDREN, MALES, INTERMEDIATE 
GRADE LEVEL GROUP 














Extended Factor Rotated Oblique Factor 
Matrix E, Matrix V 


De A B 
-L9L7 0399 e312 | 134 
1.15741 -126 0325 2106 
1.16822 -.264| . 138 06 
1.27551 -.459 -.098 450 
1.24378 e -.031 013 117 
1.35318 2192 2012 -.081 





















































Matrix S=*=L, Direction Matrix A. Direction Cosines 
Coefficients of a Normal of Reference Vectors Ap 
to Each Coordinate Plane 














Factor A‘ BY ci Facto A B 
Zz okt lS ond I e1k3 2165 
II 62 -1.00 209 II 2522 -.916 

III 1.00 .k0 | -1.00 III ~8L2 +366 









































Matrix C =A. Cosines of Matrix T. Direction Co- 
Angles Between Reference sines of Primary 
Vectors Apq Vectors T 








Factor A > | A * 2 
A 1.000 «962 i 
B - 1h7 1.900 2232 -. 387 
Cc - 766 -.422 1.000 164 084 






































8Test.--1. Reading Vocabulary, 








2. Reading Comprehension, 3. Arith- Matrix TT' = Rpq. Cosines 
of Angles Between Primary 
metic Reasoning, 4. Arithmetic Fun- Vectors T 





damentals, 5. Language, 6. Spelling Factor A B CoC 
a 1.001 
B -807 
C 2923 




















RUPIPER 


TABLE XVI 


MATRICES OF OBLIQUE ROTATION OF AXES FOR WHITE 
CHILDREN, MALES, ADVANCED 
GRADE LEVEL GROUP 








Extended Fector Rotated Oblique Factor 
Matrix E, Matrix V 








De I Il A B _o. 
1.24070 }1.000] .423 463 =.068 2015 
i «27065 000} .258 2372 -053 -.003 
1.16959 000 |=. 331 -037 099 -.008 
iL. 33333 000 -4h5 -.036 -510 -001 
1.28205 | 1.000} .065 2033 -075 350 
1.43062 |1.000] .239 2025 -.051 354 












































Matrix S= L. Direction Matrix A. Direction Cosines 
Coefficients of a Normal of Reference Vectors Ap 
to Each Coordinate Plane 














Factor, A! BI ct Factor A 
a eld 22> 220 I 2120 
II 7h -1.00 18 II 

III 1.00 e32 -1.00 III 






































Matrix C =A’A. Cosines of Matrix T. Direction Co- 
Angles Between Reference sines of Primary 
Vectors A pq Vectors Tp 








Factor] A __B A B _C 
A 1.000 0921 7920 9 
B = 28h 1.900 2307 -.353 +156 
Cc - 2645 -.402 1.000 2239 121 -. 258 
































®Test.--1. Reading Vocabulary, 





2. Reading Comprehension, 3. Arith- Matrix TT' = Rpq. Cosines 
of Angles Between Primary 


metic Reasoning, . Arithmetic Fun- Vectors Tp 








damentals, 5. Language, 6. Spelling Factor A B 

A 1.000 
B 0775 1.900 
C 864, 
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TABLE XVII 


MATRICES OF OBLIQUE ROTATION OF AXES FOR WHITE 
CHILDREN, FEMALES, ELEMENTARY 
GRADE LEVEL GROUP 








Extended Factor Rotated Oblique Factor 
Matrix E, Matrix V 








De 7 2 A B C 
eL5/ul 2209 =.117 ol 39 2930 2298 
1.15473 090 =.079 -.026 -.076 +247 
1.12233 -.232 .057 -.021 -385 -082 
1.30890 )}-.385 .177 -023 - 467 -.00 
1.30548 304 .287 298 -.00k -.020 


6 1|1.32275 00} .217 -.022 .052 | =.012 193 















































Matrix S = L. Direction Matrix A . Direction Cosines 
Coefficients of a Normal of Reference Vectors Ap 
to Each Coordinate Plane 








Factor A BI Cc! Factor B 
rT 2Ol oak 2 20 a 196 
II 0 -1,00 -20 II -.934 

III | 1.00 232 =| =-1,00 Ill -299 






































Matrix C =AA , Cosines of Matrix T. Direction Co- 
Angles Between leference sines of Frimary 
Vectors A pq Vectors Tp 


Factor A b:] A B 
A 71.000 0932 2937 
B b 067 1.000 -273 -.327 
C b .821 -. 429 2999 - 2,0 .122 












































8Test.--1. Reading Vocabulary, 








2. Reading Comprehension, 3. Arith- Matrix TT' = Rpq. Cosines 
of Angles Between Primary 


metic Keasoning, l. Arithmetic Fun- Vectors Tp 





damentals, 5. Language, 6. Spelling Factor A B 
A 1.001 


>Pactor A should read Factor C B 613 1.000 
Factor C should read Factor A C Sub 























RUPIPER 


TABLE XVIII 


MATRICES OF OBLIQUE ROTATION OF AXES FOR WHITE 
CHILDREN, FEMALES, INTERMEDIATE 
GRADE LEVEL GROUP 








Extended Factor Rotated Oblique Factor 
Matrix V 








A B C 
. 0325 =.09 2063 
1.1883 -108 2308 086 -.034 
1.17096 -. 288 089 «413 -.054 
h.. 22699 --369 ’ -.085 435 2079 
. 23305 e011 2037 -131 -182 
D .4,3267 24,8 -.014 -.067 +330 


il Test® 





















































Matrix S= L. Direction Matrix /| . Direction Cosines 
Coefficients of a Normal of Reference Vectors Ap 
to Each Coordinate Plane 














Factor Af Br ct Factor A B Co 
I el3 019 Ly I 2116 0185 elke 
II l\9 -1.00 e31 II el:37 -.972 2293 
III 1.00 15 -1.00 III 892 ~1h6 -.946 
































Matrix C =A’A - Cosines of Matrix T. Direction Co- 


Angles Between Reference sines of Primary 
Vectors A pq Vectors Tp 


Factor A B Cc A B 
A 1.000 095u 0943 
2213 -.330 


B |= 183 1.000 
C + .699 | = .397 1.000 209 2039 


*Test. --l. Heading Vocabulary, 












































2. Reading Comprehension, 3. Arith- Matrix TT! = Rpq. Cosines 
of Angles Between Primary 


metic Reasoning, . Arithmetic Fun- Vectors T,, 
y . 





damentals, 5. Language, 6. Spelling k A 
0999 


837 
2914 
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TABLE XIx 


MATRICES OF OBLIQUE ROTATION OF AXES FOR WHITE 
CHILDREN, FEMALES, ADVANCED 
GRADE LEVEL GROUP 














Extended Factor Rotated Oblique Factor 
Matrix E, Matrix V 








i A B 
. +306 25h -.030 -.9 
1.25313 2165 O11 2975 
l. 21359 -.l33 -.112 2554 
1.32626 -.517 082 566 
1.25628 2090 2296 2129 
1.37363 372 311 | -.080 









































Matrix S= L. Direction Matrix A. Direction Cosines 
Coefficients of a Normal of Reference Vectors Ap 
to Each Coordinate Plane 














Factor At cr Factor A —&B Cc 
B e1l2 eld i e113 2252 e172 
lu 3h 226 II e320 -.968 2,8 

Ili [1.00 -1.00 III 2941 -.010 -.95) 
































Matrix C =/\’A. Cosines of Matrix T. Direction Co- 
Angles Between Reference sines of Primary 
Vectors Apq Vectors Tp 


A A B 
1.000 eIy2 0593 
- -291 | 1.000 02h3 | =0bh7 
al 2799 @ -187 1.001 2 233 205 


®Test.--l. Reading Vocabulary, 








B Cc 








Factor 
AD 





























2. Reading Comprehension, 3. Arith- Matrix TT' = Rpq. Cosines 
of Angles Between, Primary 
metic Reasoning, . Arithmetic Fun- Vectors Tp 





damentals, 5. Language, 6. Spelling Factor A B Cc 
a 1.001 

>Pactor A should read Factar C B - 743 0999 
Factor C should read Factor A C 2907 727 
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and X give the unrotated factor loadings, the ro- 
tated oblique factor loadings, the direction coeffi- 
cients of the normal to each co-ordinate plane, 
the transformation matrix, and the matrix of co- 
sines of the angular separations of the oblique 
axes for full-bl ooded Indian children at the ele- 
mentary, intermediate, and advanced levels, re- 
spectively. 

The direction coefficients of a normal to each 
co-ordinate plane, Matrix S, when normal ized, 
resulted in the transformation Matrix A. The col- 
umn numbers represent the direction cosines of 
the reference vectors with respectto the unrotat- 
ed factor Matrix Fo. Post-multiplying the unro- 
tated factor matrix by the transformation matrix 
yielded the rotated oblique factor Matrix V. The 
cosines of angles between thé reference vectors, 
Matrix C (intercorrelations)'were obtained by 
multiplying the matrix of direction cosines of ref- 
erence vectors by its transformation. 

After the oblique rotations were accomplished 
for all the other groups, the cosines of angles be- 
tween the reference vectors all hadnegative signs 
indicating a positive correlation between factors 
A and B, factors A andC, and factors B and C. 
The cosines between the reference vectors are 
shown in Matrix C of Tables Vill through XIX, and 
the cosines between the primary factors are shown 
in Matrix Rpg of Tables XI through XIX. The in- 
tercorrelations were computed by using 


T= DA-! and Rpg = DA-l(DA-1)' = TT 


where T is the inverse of Matrix A normalized or 
the factor matrix for the primary vectors and Rpgq 
is the correlation between the primary factors 
which is obtained by matrix multiplication of Ma- 
trix T by its transpose. The rotated oblique fac- 
tor Matrix V for these groups was determined by 
post-multiplying the unrotated factor Matrix Fo of 
Tables V, VI, and VII by the respective transfor- 
mation Matrix A of Tables XI through XIX. 

Matrix C in Tables VII through XIX show the 
intercorrelations between the reference vectors 
while Matrix Rpg in Tables XI through XIX show 
intercorrelations between the primary vectors. 
These correlations were all significantly differ- 
ent from zero. This indicates that the obtained 
factors were not independent and that a relation- 
ship existed among them. The highest correla- 
tion existed between factors A and C which had 
significant loadings on Tests 1, 2, 5, and 6. In 
all cases, Tests 3 and 4 had significant loadings 
of Factor B which was also significantly related 
to the other two factors. These relationships in- 
dicated the presence of a second-order factor 
common to all the tests. 

In order to facilitate comparison of the various 
groups, factor patterns of the rotated oblique fac- 
tor loadings on each of the six tests are present- 
ed in Table XX. Factor loadings of less than .20 





were not included in the pattern plan. The ele- 
mentary and advanced grade level groups for fe- 
males of both races showed patterns in a reverse 
direction. Permutated rotation was used to re- 
verse the pattern so that comparisons of the sets 
of factor patterns could be made more readily 
(55). Factors A and C in Matrix V in Tables XI, 
XII, XVII, and XIX, for these two grade level 
groups, were interchanged by post-multiplying by 
the permutation matrix. Correspondingly, the in- 
terchange in Matrix Rpg was obtained by pre-mul- 
tiplying and post-multiplying it by the permutation 
matrix which resulted in an interchange of ran 
andrcp- This reversal did not effect the values 
of the factor loadings. 

Two significant factors were present for full- 
blooded Indian male children at each grade level, 
while three significant factors were present for 
the other groups. 

Rather consistent patterns of factors were ev- 
ident among grade level groups for full-blooded 
Indian male children on all tests except 5 and 6. 
Test 5 changed from a Factor A toa Factor B 
loading at the intermediate and advanced levels, 
while Factor A vanished on Test 6 at the advanced 
level. Forfull-blooded Indian female children a 
rather similar pattern existed among the three 
grade level groups. The ‘‘staircase effect’’ was 
present at all levels with the elementary group 
showing a Factor C loading on Test 5 and no sig- 
nificant loading on Test 1. A similar pattern was 
noted at the intermediate level with Factor B load- 
ings on Tests 2 and 5. For white male and fe- 
male children at all grade levels, three groups of 
two tests each, that is, Tests 1 and 2, 3 and 4, 
and 5 and 6 had significant loadings except for fe- 
males on Test 5 at the intermediate level and on 
Test 6 at the elementary level. 

The largest factor loadings were evident for 
full-blooded Indian male children. In most cases 
the loadings tended to increase by grade level 
groups for each race and sex. The sets of factor 
patterns between race groups appearedto be more 
varied than between sex. This was noted espe- 
cially for females where the factors clustered in 
groups of two on the same tests at the various lev- 
els for white children. The same pattern was not- 
ed at the advanced level for full-blooded Indian 
children, but at the other two levels greater pat- 
tern variability was present. In relation to sex, 
the factor patterns were similar for white chil- 
dren, but they showed more inconsistency for the 
full-blooded Indian children. 


Ill. Interpretation 





Since the tests used in this study were all 
paper-and-pencil variety, and since the tests pur- 
ported to measure the basic educational skills, the 
nature of the factors were considered in view of 
educational achievement. Thislimitation was un- 
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desirable, but restrictive because no other meas- 


urements were available for the subjects studied. 


In view of the tests’ content, Factors A and C 
were identified as verbal factors and Factor B 
was identified as a numerical factor with a com- 
mon factor underlying all the tests, which might 
be identified as a Schooling factor. It would be 
inappropriate to attempt an interpretation of the 
abilities necessary to perform the tasks called 
for by the various tests. The tests with more 
verbal material, Reading vocabulary, Reading 
comprehension, Language, and Spelling did not 
remain clustered as well as did those tests with 
number material, Arithmetic reasoning, and 
Arithmetic fundamentals. 


Evaluation of Hypotheses2 





Hypothesis 1—A comparison between the mean 
raw scores computed by the critical ratio showed 
a significant difference, .05 level of confidence, 
on all tests for the three grade level groups in 
favor of white children. With regard to sex dif- 
ferences, the mean raw scores for female white 
children were significantly higher, .05 level of 
confidence, thanfor male white children with one 
exception: At the advanced grade level males 
surpassed females on the Arithmetic reasoning 
test. In the majority of cases, the mean raw 
scores were Significantly higher, .05 level of con- 
fidence, for full-blooded Indian female children. 
The mean differences in favor of full-blooded In- 
dian male children were on the Reading compre- 
hension and Arithmetic reasoning tests at the el- 
ementary level, Reading vocabulary, Reading 
comprehension, and Arithmetic fundamentals at 
the intermediate level, and Reading vocabulary 
and Reading comprehension at the advanced level. 

Hypothesis 2— Two factors were found for full- 
blooded Indian male children at the elementary, 
intermediate, and advanced grade levels. These 
were identified as a verbal anda numerical factor. 
The high correlations between these factors sug- 
gested asecond-order general factor. Three fac- 
tors were found for full-blooded Indian female 
children and for male and female white children 
at each of the three grade level groups. 

Hypothesis 3— The factor loadings ranged from 
.20 to .65 for full-blooded Indian children and 
from .21 to .57 for white children. More high 
loadings appeared for full-blooded Indian male 
children thanfor any other race-sex group. 

Hypothesis 4— The factor loadings on the read- 
ing tests were higher for full-blooded Indian male 
children than for white malechildren. Ingeneral, 
the differences inloadings were .10 or greater in 
favor of the full-blooded Indian children. A sim- 
ilar difference was evident on the language and 
spelling tests. 

Hypothesis 5—Full-blooded Indian male chil- 
dren showed higher loadings of a number factor 





on the arithmetic tests than did white male chil- 
dren. Generally, the size of the loadings of this 
factor increased with increase in grade level. 
For full-blooded Indian female children the load- 
ings were highest at the intermediate level and 
lowest at the elementary level, whereas, for the 
white female children the loadings were highest at 
the advanced level and approximately of the same 
magnitude at the elementary andintermediate 
level with moderate loadings. 

Hypothesis 6—The correlation matrices for all 
groups analyzed showed highly significant positive 
relationships. This suggested an underlying com- 
mon factor. The common factor was substantiat- 
ed by the significant intercorrelations existing be- 
tween the reference and the primary vectors. 


IV. Discussion 





There is little evidence regarding differences 
in sets offactor patterns and factor structures by 
race and sex for the elementary, intermediate, 
and advanced grade level groups. Although the 
results did not conform ideally to Garrett’s (25) 
differential hypothesis whereby abstract intelli- 
gence changes in organization from a general abil- 
ity to anorganized group of abilities or factors as 
age increases, the arithmetic tests showed rather 
consistent increases in factor loadings between 
the grade levels from the elementary to the ad- 
vanced level. A similar trend was noted for both 
full-blooded Indian and white children on the read- 
ing tests which was more pronounced for males. 
This evidence was based on cross-sectional rather 
than longitudinal data. When the correlations be- 
tween the reference vectors and the primary vec- 
tors are considered, in most cases the correla- 
tions decreased with increase in grade level. 
These correlations were significantly different 
from each other which indicated differentiation be- 
tween grade level groups. 

With regard to the interpretation of the factors 
it should be pointed out that with only six variables 
the reference frames would not be too stable, 
therefore, the factor loadings might be subject to 
change from sampleto sample. It was difficult to 
adequately establish the traces on the test projec- 
tions with only six tests and criterion tests with 
established factors for the subjects in this study 
were not available. 

The high positive correlations among the tests 
and among the primary vectors strongly suggest- 
ed the presence of a factor in the second-order 
domain. Apparently, this accounted for thelow 
loadings on thefirst-orderfactors. It may be as- 
sumed that the tests are rather complex inthat 
they not only involve a specific factor, but also 
the common factor defined by the tests. Tests 1, 
2,5, and 6 appeared to be more complex than 
Tests 3 and 4. The reading and language tests in- 
volved two factors in addition to the common fac- 
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tor. For full-blooded Indian male children the 
language test clustered with the arithmetic tests 
at the intermediate and advanced levels. Per- 
haps at these levels the three tests have some as- 
sociative characteristics. A similar trend was 
noted for full- bl ooded Indian female children at 
the intermediate level withthe reading compre- 
hension test also joining the cluster. The clus- 
tering of the reading comprehension and language 
tests with the arithmetic tests did not occur for 
the other groups. 

Since full-blooded Indian children and white 
children receive their education in American 
schools, it seems reasonable to assume that the 
Indian children shouldlearn the same basic skills 
as the white children. The results as shown in 
this study as well as inother studies cited herein, 
indicated that significant differences existed be- 
tween races in educational achievement. Appar- 
ently, factors other than school experiences play 
an important role. These factors could be ge- 
netic, environmental, and experiential. Accord- 
ing to French (21), the factors might readily be 
conceptualized as experiential factors since they 
were obtained from measures on achievement 
tests. French (21) named the common factor 
which involves skills acquired in school, e.g., 
fractions, decimals, arithmetic problems, and 
vocabulary tests, as Schooling. Thisfactor usu- 
ally appears in the second-order domain. He calls 
attention tothe interpretation of the general, Num- 
ber, and Verbal factors which are all quite dis- 
tinct, and the relation between the Verbal or 
Number factor has nothing to do with the actual 
mechanism of Verbal or Number ability. Itis 
perhaps an arbitrary requirement of American 
schools which is determined by differences in 
amount of formal education. The findings in this 
study seem to corroborate Fr ench’s interpreta- 
tion. 

Since the factor patterns and size of factor 
loadings are essentially the same for both racial 
groups by grade level and sex, it indicates that a 
large proportion of the content of the California 
Achievement Test Battery measures approx- 
imately the same basic educational skills at each 
grade level. This evidence of continuity of con- 
tent and the variability of mean achievement sug- 
gests that various other factors are interferring 
with good articulation between elementary and in- 
termediate and between intermediate and ad- 
vanced grade levels for the two racial groups. 

The significant differences in mean achieve- 
ment and the slight discrepancies in the sets of 
factor patterns between full-blooded Indian and 
white children might be speculated from a theo- 
retical viewpoint. The groups studied were rea- 
sonably comparable with respect to geographical 
location and educational grade placement. As- 
suming that these conditions were sufficiently 
controlled, it seems that other factors would ac- 





count for the differences obtained. Americancul- 
ture perhaps determines very largely that both In- 
dian children and white children develop basic ed- 
ucational skills since these skills are important to 
both races outside an educational setting. How- 
ever, the two races undoubtedly place different 
values on these fundamental skills even though they 
both function inour American society. Because of 
a difference in the interaction between the indi- 
vidual’s educational environment and his cultural 
milieu it seems reasonable to assume that Indian 
children are not as cognizant of the ultimate need 
for these basic educational skills. Other than cul- 
tural differences, slow work habits, the nature of 
the test items, poor motivation, level of aspira- 
tion, and lack of interest may also account for 
some of the differences encountered. The ques- 
tion also arises as to whether the tests were equal- 
ly fair to both groups. Undoubtedly, these condi- 
tions could have effected the results of this study. 


we Summary 


The University of Kansas had conducted a fall 
achievement testing program for Indian and white 
children at the elementary, intermediate, and ad- 
vanced grade levels for the United States Bureau 
of Indian Affairs during the period 1951-55. The 
California Achievement Test Batteries were used 
to obtain scores of educational achievement in the 
following basic skills: reading vocabulary, reading 
comprehension, arithmetic reasoning, arithmetic 
fundamentals, language, and spelling. When the 
mean scores of these tests were compared as to 
race, significant differences were found in favor 
of white children. 

The sample was limited to all full-blooded In- 
dian children and white children on whom a com- 
plete set of test scores was available. The distri- 
bution of the 14, 888 subjects used in this study 
consisted of 5502 full-blooded Indian children and 
9386 white children. The geographical 1 oc ations 
of the administrative areas of the United States 
Bureau of Indian Affairs in which the testing was 
conducted were: 1) Phoenix, 2) Albuquerque, 3) 
Aberdeen, 4) Billings, 5) Muskogee, and 6) Ana- 
darko 

Pearson product-moment correlations between 
the tests were computedfor the various grade lev- 
el groups by race and sex. The twelve intercor- 
relation matrices were factor analyzed by Thur- 
stone’s centroid method and the factors were sub- 
jected to oblique rotation by means of extended 
vectors in accordance with the concept of simple 
structure. Three factors resulted. With oblique 
rotation, the third factor was not significant for 
full-blooded Indian male children. Correlations 
between the reference vectors and the primary 
vectors were obtained. Factors A andC were 
identified as verbal factors and Factor B as a nu- 
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merical factor with highly significant correlations 
between the factors. This indicated a second- 
order factor common to all tests. 

The purpose of this study was to compare the 
sets of factor patterns and factor structures by 
race, sex, and grade levels. Conclusions from 
the factor analyses follow: 


1. Two factors were found for full-blooded In- 
dian male children at each of the three grade lev- 
el groups, while three factors were found at each 
grade level for full-blooded Indian female children 
and for male and female white children. 

2. Higher factor loadings were found for full- 
blooded Indian male children than for any other 
race-sex group. The factor loadings ranged from 
-20 to .65 for full-blooded Indian children and from 
.21 to .57 for white children. 

3. In most cases the factor loadings of a verb- 
al nature were higher for full-blooded Indian 
male children. 

4. In general, the factor loadings of a numer- 
ical nature were higher for full - blooded Indian 
children except on the arithmetic reasoning test 
where white female children at the advanced level 
had the highest loading. 

5. The sets of factor patterns for all groups 
were more similar than they were different. 

6. Acommon factor of the second-order do- 
main was evident throughout all groups. This 


factor is probably Schooling. 


FOOTNOTES 


* This monograph is based on the author’s dis- 
sertation completed at the University of Kansas. 
The data used in this study were obtained from 
the Indian Education Research Project conduct- 
ed by the United States Bureau of Indian Affairs 
and the University of Kansas during the period 
1951-55. Mr. L. M. Coombs, Education Spe- 
cialist, United States Bureauof Indian Affairs, 
offered suggestions which were essential to the 
study. Dr. K. E. Anderson and Dr. A. H. 
Turney, Committee Members, read and eval- 
uated the manuscript. Dr. R. N. Bradt, De- 
partment of Mathematics, provided helpful ad- 
vice on the statistical and factorial analyses. 
Mr. C. Bradrick, Director of the Statistical 
Bureau, compiled the data for computation of 
the correlations. The author is indebted to 
these persons for their cooperation and assis- 
tance. The author is deeply indebted to his 
major advisor, Dr. E. Gordon Collister, not 
only for his excellent direction and advice, but 
also for his encouragement and inspiration. 


** Now at the College of Education, University of 
Oklahoma. 





203 


. The writer is indebted tothe staff of the Statis- 


tical Bureau, University of Kansas, for carry- 
ing out the necessary machine operations for 
these factor analyses. 


. The following scale was used for the evaluation 


of factor loadings: 


. 00 to .20 
. 20 to .30 
. 30 to . 50 
. 50 and above 


Insignificant loadings 
Low loadings 
Moderate loadings 
High loadings 
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THE DEVELOPMENT of materials for helping 
the child become sensitive to the dynamics of be- 
havior has been one of the aims of the Preventive 
Psychiatry Research Program in the Child Wel- 
fare Research Station at the State University of 
Iowa. Among the material prepared is a group of 
narratives designed to help the kindergarten child 
develop a causal orientation toward his social en- 
vironment. If these stories are to be an effective 
medium for introducing the young child to a caus- 
al way of thinking, it seems necessary that the 
children should enjoy listening to them. The pri- 
mary purpose of this study, therefore, was to 
discover how well kindergarten children liked to 
listen to and discuss causally oriented narratives 
specially written for their age. The statistical 
procedure chosen to determine this was the meth- 
od of paired comparisons. 

The purpose of the causally oriented story is 
to introduce the child to the differences between a 
non-causal or surface approach and a causal ap- 
proach to human Situations. In the narrative, a 
social situation is presented and someone begins 
to act in a non-causal way. The problem is then 
re-examined and the individual proceeds more 
causally after which the effects of this type of ac- 
tion are considered The reading of the story is 
accompanied by a discussion of differences be- 
tween the two approaches and some of the probable 
effects of each. 


Description of the Investigation 





During the school year 1954-55, a number of 
teachers participated in the Preventive Psychia- 
try Program (5) by presenting causally oriented 
materials totheirclasses. The teachers had tak- 
en part in an intensified training program through 
summer workshops set up to train them to prac- 
tice and teach the dynamic approach. The pupils 
in two of the kindergarten groups were chosen as 








subjects for this study. Class A consisted of 41 
children in the morning and afternoon classes of 
teacher A, and Class B of 61 pupils in the morn- 
ing and afternoon classes of teacher B. 

In the method of paired comparisons, the sub- 
ject is presented a number of different stimuli in 
pairs and is asked to judge which is greater with 
respect to some characteristic, such as, which of 
two objects is heavier, which is prettier, or which 
of two activities he prefers. In this particular in- 
vestigation each child was asked which of two kin- 
dergarten activities he preferred. By statistical 
analysis of paired comparisons it is possible to 
assign a set of numbers to the different stimuli 
the subjects have been asked to judge. 

Six different kindergarten activities were stud- 
ied, namely: having music, working puzzles, hav- 
ing the teacher read ordinary or regular kinder- 
garten stories, playing house, painting at the easel, 
and having the teacher read the causally oriented 
primary stories. These kindergarten activities 
were combined in all possible pairs. Each child 
was taken privately by a trained person! and asked 
which of each pair of activities he preferred. 

The first few minutes with each child was spent 
in establishing rapport and seeing if the child un- 
derstood the difference bet ween the two types of 
stories being used as stimuli. The regular kin- 
dergarten stories, often read by the teacher, con- 
sisted of a pile of several books from the book ta- 
ble. The causally orientedstories were designat- 
ed as the book without any pictures from which the 
teacher read to them; a copy of this book was on 
the table so that the child could recall it. A final 
check was made by asking the child if he knew the 
difference between the stories in this book without - 
any pictures in it andthe stories here—pointing to 
the books from the table. The administrator did 
not let the child see the books from the table other 
than the stack as the child was to respond to the 
books from the library table rather than to specific 


* The preparation of this paper was supported by the Grant Foundation. 
** Address: Preventive Psychiatry Research Program, Child Welfare Research Station. 
1. Mrs. Helen Marie Clark, Elementary Supervisor for the Preventive Psychiatry Program, 1955-56, 


administered the test. 
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stories, and, likewise, the book containing the 
causally oriented stories was only shown to the 
child without permitting him to leaf through it. 
The presentation of the pairs of stimuli were 
presented in such a manner as to adhere to the fol- 
lowing principles as closely as possible (10, 11): 


1. Maintain the greatest possible spacing be- 
tween pairs involving any given member of 
the stimulus group. 


Present a stimulus equally often first and 
second. This is referred to as taking ac- 
count of ‘‘space errors.’’ 


. Present stimuli as nearly in the same man- 
ner as is possible so as to avoid suggestion 
effects. 


In order to insure the fullfillment of the third 
principle, the questionfor the presentation of each 
pair of objects was carefully planned and written 
in the instructions. Examples of these questions 
were: 


. Which would yourather do, listen to a stery 
from this book (the causally oriented stories) 
or listen to a story from this book (regular 
stories)? (In each case only pointing to the 
book or books. ) 


2. Which would youratherdo, have music time 
when you sing or work with a puzzle? 


3. Which would you rather do, play house or 
paint at the easel ? 


Analyses of the Data 





The first step in the analysis of preferences of 
the six kindergarten activities used in this study 
was the preparation of a proportion matrix show- 
ing the proportion of times each activity was pre- 
ferred over its paired activity. Table I and Table 
Il give the proportion matrices for the children in 
Classes A and B respectively. In these matrices 
each stimulus at the top is judged greater than 
each stimulus onthe side by the corresponding 
proportion. For example, in Table I the propor- 
tion of the time ‘‘Music’’ was preferredover ‘‘Ea- 
sel Painting’ is .463. In Table II the proportion 
of time ‘Playing House’’ was preferred over 
‘*Regular Stories’’ is . 623. 

The analysis of the proportion matrices in Ta- 
bles I and II is based upon the application of Thur- 
stone’s law of comparative judgment (13). This 
law is stated in the following form: 





Rj - Ry Z jk Y sj + si - 2r jKSjSk 





where Rj and Ry, = mean psychological val ues 
characteristically attached to stimuli Sj 
and S, respectively, 

Zjk = standard measure distance or deviate from 
the mean of a unit normal distribution, 

Sj and s, = standard deviations of distributions 
Rhj and Rpk respectively, and 

rjk = coefficient of correlationbetween Rpj and 
Rhk- 


The radical term is the standard deviation of the 
differences Rhj - Rhk is expressed as a function of 
the terms under the radical. 

In Thurstone’s Case III, he assumes rj, = 0, 
that there is no correlation between responses to 
any pair of stimuli. Under this assumption the 
law of comparative judgment reduces to the form 


Rj @ Ry, = ZjkV " + Sk 


Scale-values in terms of preference for the six 
activities as judged by the children in Class A and 
Class B were analyzed under the assumption of 
Thurstone’s Case III. This was done for Class A 
and Class B separately and for the combined 
group. 

We find the zj,x values from a knowledge of the 
experimental proportions Pj>k- The Zjk is the z- 
value, given ina table of the normal curve, for 
the respective proportion, consider the proportion 
pj>k a8 thearea under the normal curve. When 
the proportion p is greater than .500 it’s z-value 
is positive; when p isless than .500 it’s z-value is 
negative. 

Table III gives the zj, values for the respective 
proportions obtainedfor Class A as given in Table 
I. The sum column in Table Ilis used as a means 
of checking the computations. The sum of the 
sums should equal zero, and the sum of the col- 
umn means should also equal zero. 

Burros (1) proposed some convenient formulas 
for estimating the relative values for the standard 
deviations: 


5§5-=- 

where sj = the standard deviation for the dis- 
persion of stimulus §; 

Vz. = the variance of deviations zj, in each col- 
umn of Matrix Zs from the column mean, 
Mz. these values are found in Table II for 


Class A. 
c is estimated by the formula 





TABLE I 


PROPORTION MATRIX FOR THE SIX KINDERGARTEN ACTIVITIES IN TERMS OF PREFERENCE 
AS JUDGED BY THE FORTY-ONE PUPILS IN CLASS A* (MATRIX Pa) 





Kindergarten Regular Playing Working Easel Causal 
Activities Stories House Puzzles Painting Stories 





Regular Stories . 500 ‘ . 561 . 537 . 610 . 902 
Music . 439 . . 707 . 561 . 537 . 732 
Playing House . 439 . . 500 . 659 . 561 . 780 
Working Puzzles - 463 : . 341 . 500 . 659 - 683 
Easel Painting . 390 , . 439 . 341 . 500 . 659 
Causal Stories . 098 : . 220 . 317 . 341 . 500 


=Pj >k 2.329 ; 2. 768 2.915 3. 208 4. 256 





*Kindergarten Activities have been arranged in order of increasing =pj> k 


TABLE I 


PROPORTION MATRIX FOR THE SIX KINDERGARTEN ACTIVITIES IN TERMS OF PREFERENCE 
AS JUDGED BY THE SIXTY-ONE PUPILS IN CLASS B* (MATRIX Pp) 





Kindergarten Regular Playing Working Easel Causal 
Activities Stories Music House Puzzles Painting Stories 





Regular Stories . 500 . 574 . 623 . 656 . 787 . 918 
Music - 426 . 500 . 459 . 459 . 656 . 705 
Playing House . 377 . 541 . 500 . 607 . 525 . 623 
Working Puzzles . 344 . 541 .393 , . 500 . 627 . 623 
Easel Painting . 213 . 344 . 475 . 328 . 500 . 443 
Causal Stories . 082 . 295 . 377 ~ 3tT . 557 . 500 


EPj> k 1.942 2.795 2. 827 2. 927 3. 697 3. 812 





*Kindergarten Activities have been arranged in order of increasing =Ppj > k across the columns. 
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where n is the numberof stimuli, and Vz; is 
defined as above. 


Substituting the zj, and Mz jx values in Table 
Il] in the above formulas we can estimate the 
standard deviations for the six kindergarten activ- 
ities as judged by the forty-one children in Class 
A. Table IV gives the statistics needed for com- 
puting Burro’s formula for the Class A group. 

The sum of the reciprocals of the variances is 
55. 379 and the number of stimuli is six, giving a 
c-value of .1083. As a check on the computations 
the Sj values should sum ton, the number of stim- 
uli. This is due tothe fact that this method as- 
sures a mean sj; of 1.000. 

Table V contains the (Rj - Rx) values which 
are computed according to the assumptions of 


Thurstone’s Case Ill, Rj - Rk = Zjx Sj + Si and 


the Rj scale values are based upon these assump- 
tions. The Rj value is a method of arbitrarily 
assigning the zero point to the lowest mean and 
adding the amount necessary to make the lowest 
mean .000 to eachof the mean values. Either the 
mean values or the Rj values can be taken as 
scale values for the different kindergarten activ- 
ities. Guilford (2) presents a method of locating 
a psychological meaningful zero point; however, 
this requires addtional data. 

The sum column in Table V is used as a means 
of checking the computations inthe table, the sum 
of the sums should equal zero, the sum of the col- 
umns means should also equal or approximate 
zero, the sum of the Rj’s should equal n times the 
number added to each mean Mz jx, or .353n, where 
n is the number of stimuli. 

Given the scale values in Table V, thenext 
problem is to determine how well the observed 
proportion of pupils’ judgment agree with the ex- 
pected proportion as determined by the above scale 
values. This test of internal consistency is made 
by utilizing Mosteller’s method (8) by whicha 
single chi-sqare test is made for the obtained 
and expected proportions. The first stepis to find 
the expected z'j, values. These are obtained by 
applying the following formula: 

‘ Rj - Rk 

Z jk = ———— 

Vv SS + s- 

J k 
where the Rj - Ry, values inthe numerator are ob- 
tained from the interpair scale differences given 
in the Rj row in Table V. Thedenominator is the 
square root of the sum of the paired sj’s in the 
last row of Table IV. These z';, values are giv- 
en in Table VI. Only half the matrix is needed 
since the twohalves are identical in absolute val- 
ues. The diagonal cells have been ignored as 


there are no actual experimentally obtained values. 


The expected proportions p'’, given in Table 
Vil, are the areas under the normal curve for the 





respective z' j, values in Table VI. If z' should 
be negative the area under the smaller portion of 
the curve is used, whereas when z' is positive the 
area under the larger portion is used. 

The test for internal consistency as proposed 
by Mosteller (8) does not use the actual p and p' 
values since many of the proportions might be 
close to 1.00, whichis the region where sampling 
distributions of proportions are not normal even 
for reasonably large samples. In order to com- 
pensate for this lack of normality, each propor- 
tion is transformed to a statistic 8, whose sam- 
pling distribution is normal. Theta being the 
angle whose sine is the Vp, or @ =arcsin Vp. Ta- 
bles VIII and IX give the @-values for the obtained 
and experimental proportions respectively. The 
chi-square form applicable for this test of intern- 
al consistency is 

hi o— = - Q' )? 
chi-square 321 =(8 - 8") 

where N is the number of judgments per stimulus 
pair, 9 is the arcsin Vp and @' is the arcsin vp’. 
Mosteller (8) gives the degree of freedom for the 
above chi-square as 


df = (n - 1)(n - 4) 
—,—— 


where n is the number of stimuli. Thesum of the 
squared differences between the observed and ex- 
pected theta is 259.6927, andthe number of pupils 
is forty-one; therefore 


Chi-square = 


41(259.6927) ar _ 5 
~ 6al i 


Chi-square 12.969 


Since the chi-square is not significant at the 
one percent level one could assume internal con- 
sistency between the observed and expected pro- 
portions for the preference of the six kindergar - 
ten activities as judged by the forty-one children 
in Class A. The scale-values for these activities 
are the Rj values given in Table V, namely: Reg- 
ular Stories .000, Music .017, Playing House 
.259, Working Puzzles .266, Easel Painting . 524, 
and Causal Stories .1.053. The zero point is ar- 
bitrarily assigned. 

Analysis of the preference of the six kinder- 
garten activities for Class B, using Thurstone’s 
Case III assumptions yielded slightly different re- 
sults as obtained for Class A. Table II gives the 
proportion of time each activity was preferred 
over its respective pair. The transformation of 


these proportions into Rj - Rx values where R; - 
Ry = Zjk sf + Sf and the resultant scale - values 


are presented in Table X. Testing of the obtained 
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TABLE VI 


EXPEC TED SCALE SEPARATIONS z' BETWEEN THE SIX KINDERGARTEN ACTIVITIES DERIVED 
FROM SCALE VALUES OBTAINED BY THE ASSUMPTIONS OF CASE III (MATRIX Z' 4) 





Kindergarten Playing Working Easel Causal 
Activities Si House Puzzles Painting Stories 





Regular Activities ‘ . 308 .214 . 305 1. 167 
Music . 160 . 141 . 238 . 671 
Playing House . 006 . 152 . 844 
Working Puzzles .131 . 599 
Easel Painting . 299 





TABLE VII 


EXPECTED PROPORTIONS p' DERIVED THROUGH z' FROM SCALE VALUES FOR SIX KINDER- 
GARTEN ACTIVITIES, AS JUDGED BY PUPILS IN CLASS A, UNDER THE ASSUMPTION 
OF CASE II (MATRIX P" a) 





Kindergarten Playing Working Easel Causal 
Activities House Puzzles Painting Stories 





Regular Stories ‘ . 621 . 585 . 620 . 878 
Music . 564 . 556 . 594 . 749 
Playing House . 502 . 560 . 801 
Working Puzzles . 552 . 725 
Easel Painting . 618 
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TABLE VIII 


OBTAINED PROPORTIONS FOR THE SIX KINDERGARTEN ACTIVITIES CONVERTED INTO ANGLES 
IN TERMS OF DEGREES (@) BY MEANS OF THE FUNCTION @ = ARCSIN Vp (MATRIX @) 








Kindergarten Playing Working Easel Causal 
Activities House Puzzles Painting Stories 





Regular Stories 48. 50 47.12 51.35 71.76 
Music 57. 23 48.50 47.12 58.82 
Playing House 54. 27 48. 50 62.03 
Working Puzzles 54. 27 55. 73 
Easel Painting 54. 27 





6 values for Class A. 


TABLE IX 


EXPECTED PROPORTIONS FOR THE SIX KINDERGARTEN ACTIVITIES CONVERTED INTO ANGLES 
IN TERMS OF DEGREES (6') BY MEANS OF THE FUNCTION @' = ARCSIN vp’ (MATRIX @° ) 








Kindergarten Playing Working Easel Causal 
Activities sic House Puzzles Painting Stories 





Regular Stories + > 52.00 49.89 51. 94 69. 56 
Music 48. 68 48.22 50. 42 59. 93 
Playing House 45.11 48.45 63.51 
Working Puzzles 47.98 58.37 
Easel Painting 51.83 


8’ values for Class A. 
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and expected proportions by Mosteller’s method 
showed internal consistency of the data at the one 
percent level, the level chosen to reject the hy- 
pothesis. The scale-values computed for the six 
kindergarten activities as derived by means of 
paired comparisons and judged by the sixty -one 
pupils inClass B were Regular Stories .000, Play- 
ing House .333, Music .389, Working Puzzles 
-481, Causal Stories .860, and Easel Painting 
-931. Two pairs of activities reversed position 
when compared with scale-values for the kinder- 
garten activities as judged by the children in Class 
A, namely, Music and Playing House, and Easel 
Painting and the Causal Stories. 

As another part of the analysis of the investi- 
gation, the data for Class A and Class B were 
pooled and scale-values for the combined group 
were calculated. Table XI gives the obtained pro- 
portions for the combined group. 

Thurstone’s Case II] assumptions were applied 
to the proportions obtained by pooling the data. 
The Rj - Rx values under these assumptions are 
presented in Table XII plus the scale-values for 
each of the kindergarten activities. 

The chi-square test for internal consistency 
was not significant at the one percent level. The 
scale-values thus obtained by Thurstone’s Case 
Ill assumptions for the combined classes were 
Regular Stories .000, Music .188, Playing House 
.305, Working Puzzles .366, Easel Painting . 816, 
and Causal Stories .894. The zero point is an 
arbitrarily assigned value. 


Conclusions and Summary 





The purpose of this experiment was to find out 
how well kindergarten children liked to listen to 
the causally oriented stories specially written for 
their age. Subjects were forty-one kindergarten 
children in Class Aand sixty-one in Class B. 
Thurstone’s Case II] assumptions were used. Af- 
ter comparing this activity withfive other kinder- 
garten activities by means of paired comparisons, 
the results showed that listening to the prepared 
causal stories was preferred by Class A and the 
combined group over the other five kindergarten 
activities, namely, having Music, Working Puz- 
zles, Painting at the Easel, Playing House, or 
Having Regular Kindergarten Stories read to 
them. The significance of internal consistency 
was tested by using the one percentlevel of rejec- 
tion. A similar analysis of the paired compari- 
son data for Class Bshowed a reversal in the 
preference of Causal Stories and Easel Painting. 
Easel Painting rated first and causal narratives, 
second. 

Thus, using the materials designed to help the 
child develop a causal orientation toward his so- 
cial environment ranked well with other activities 
included in this investigation. 
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Table XII is a summary of the scale values 
assigned to the six kindergarten activities as 
judged by pupils in Class A, Class B, and the 
Combined Classes A and B, using Thurstone’s 
Case III assumptions for scaling the paired com- 
parison data. 
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ANALYSIS OF MMPI DATA FOR CLASSIFICA- 
TION PURPOSES BY MULTIVARIATE 
STATISTICAL TECHNIQUES’ 


PETER P. REMPEL 
State University of Iowa 


THE PURPOSE of this paper is to present the 
steps involved in the application of multivariate 
analysis tothe assignment—witha minimum error 
of classification—of N people to q criterion groups 
on the basis of all information provided by k in- 
tercorrelated measures. 

The study itself stems from the writer’s at- 
tempt to identify as potential delinquents or non- 
delinquents (q = 2), a population of ninth-grade 
high-school boys (N= 1802) to whom the MMPI had 
been administered by Hathaway (3) in 1947-1948, 
and for whom police and/or court records were 
available through 1952. Severity of misdemeanor 
made possible a classification of delinquency into 
eight levels. Because of inter-group similarity 
of MMPI profiles between nondelinquents and de- 
linquents at levels one andtwo, the latter (N = 363) 
have been omitted from this study (see Table I). 

Variables used in one part of the study, re- 
ferred to as the main study, consisted of the 
scores on the F, Hs, Sc, Mf, and Pd scales of the 
MMPI (k = 5); in another part, the school study, 
they consisted of the Ma, Sc, Mf, and Pd scales 
of the MMPI; honor point ratio, and numbers of 
days absent in Grade VIII (k = 6). 

In the main study the MMPI profiles of 351 boys 
who had achieved delinquency status by 1952 were 
divided by random procedures into two groups: 
Sample A (N = 175) which was used in the develop- 
ment of the classification formula, and Sample B 
(N = 176), set aside for cross-validation pur- 
poses. Two samples of 175 profiles each were 
similarly selected from the nondelinquent set of 
MMPI profiles. Inthe school study both delin- 
quent and nondelinquent groups were divided in- 
to Samples A' and B' andfour sub-groups (N = 91) 
were thus similarly obtained. 


Tests of Assumption 





Numerical values obtained for the statistics Bit 
and £2, —the former used as ameasure of skewness, 





the latter as a measure of kurtosis—indicated that 
roughly the distributions of only half of the vari- 
ables tested in either study (in Samples A and A’ ) 
satisfied the assumption of normality. Somewhat 
more satisfying results were obtained from tests 
of homogeneity of variances andcovariances. Thus 
approximately 30 percent of the tests indicated 
lack of homogeneity of variances ata statistically 
significant level, and 20 percent indicated lack of 
homogeneity of covariances. The effect of non- 
normality upon the distribution of D* and the LDF 
statistic is not known. Atest of the assumption of 
linearity of regression (4:240-245) was omitted be- 
cause the number of variables involved made the 
amount of labor required too excessive. Inspec- 
tion of correlation charts of selected pairs of var- 
iables made it appear unlikely that anything other 
than linearity existed. Because cross -validation 
has been included, the application of multivariate 
analysis was thought to be justified even though not 
all of the assumptions have been fully met. But in 
drawing conclusions from this study, one must 
recognize the fact that not all of the assumptions 
for the application of multivariate procedures were 
fulfilled. 


Discussion of Statistical Techniques 








The generalized distance function contr ibutes 
both to the problem of testing significance and of 
estimation. The five (or six) variables used in the 
main andschool studies respectively, may be con- 
sidered as defining a space of five (or six) dimen- 
sions so that any individual with assigned scores 
can be represented by a pointin such a space. The 
individuals belonging to any group will then be rep- 
resented by a cluster of points round another point 
which represents the mean values of these scores. 
If the mean values of the two groups are close to- 
gether, then the clusters of points corresponding 
to them will overlap to some extent. Rao (6) has 
shown that the degree of overlap between any two 


*Submitted to the Graduate School of the University of Minnesota in partial fulfillment of the require- 


ments for the Ph. D. degree. 
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TABLE I 


DISTRIBUTION OF VALID MMPI PROFILES BY DELINQUENTS, LEVEL OF DELIN- 
QUENCY, AND NON-DELINQUENTS 





Percent Percent 
(Based on (Based on 
N = 1802) N = 714) 


Classification Number 








All Delinquents 714 39. 6% 


Levels 1 and 2 36: 20. 
P a 


Level 3 3 
Level 4 
Level § 3 
Level 6 2% 
Level l 
Level 


Total Level 3-8 
Non-delinquents 


Total (All delinquents 
and non-delinquents) 








TABLE I 


A SUMMARY COMPARISON OF CLASSIFICATION EFFICIENCY FOR THE LDF PROCEDURE 
WITH CHANCE EXPECTATION FOR SAMPLES A AND BIN THE MAIN STUDY BY DELIN- 
QUENT AND NON-DELINQUENT GROUPS WITHIN THE SAMPLE 





Correct Classi- Correct Classi- Percentage 
fication by fication by Improvement 
Formula Chance Over Chance 


Samples Used N q N % 








Sample A - Delinquents 175 2 68. 87. 

Sample A - Non-delinquents = 175 + 65. 87. 
Total Sample A 350 235 67. 

Sample B - Delinquents 176 70. 88. 

Sample B - Non-delinquents = 175 : 58. 87. 
Total Sample B = 351 2 64. 175. 


All Groups Total = 701 5 65. 








* Significant at the 5 percent level 
**Significant at the 1 percent level 
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such clusters can be measured by the quantity 


2 i=5(6) j=5(6) ij 
D* = | >a did 
i=1 j=1 


where djd9;..., d5 (or dg) are the differences 
between the mean values of the measurements in 
the two groups, and a) are the elements of the 
matrix reciprocal to the within group dispersion 
matrix (ajj). The higher the value of D’, the 
smaller the amount of overlap, so that this meas- 
ure may be regarded as the ‘‘distance’’ between 
the groups. 

The calculation of D? is giveninthe basic form 
as 
2 


D 


= 2j2j said; (1) 


S= elements of the matrix inverse to Sjj, the 
dispersion or within groups variance-co- 
variance matrix. 


oe = 


dj = Xj - Xj 
dj = Xj - X; 


where Xj; represents the sample mean of the ith 
variable and Xj is the sample mean of a second 
category of the ith variable. 

The ‘‘best’’ (unbiased) estimate of the sample 
variance used in the matrix fcr the ith test is 


Nx 2 — Ny 2 — 
= Xn -Nxk, +2 Yuu - NY? (2) 
kel ik x44j ke1 ik i 


and the unbiased estimate of the covariance be- 
comes 


the random variable de- 
fined by the ith test (or 
scale) in the first group, 

the random variable de- 
fined by the ith test (or 
scale) in the second group, 
the score of the kth indi- 
vidual in the first group 
on the ith test. 

the score of the kth indi- 
vidual inthe second group 





on the ith test, 

the sample mean of the ith 
test in the firstgroup, and 
the sample mean of the ith 
test in the second group. 


Rao (9) metthe problem of eliminating the cor- 
relation between variables by suggesting a method 
of transforming the original set of data to an or- 
thogonal set of variates. The most efficient meth- 
od of accomplishing this purpose is the application 
of pivotal condensation to the calculation of D*, 
and is the method used in this study. A system- 
atic and straight-forward illustration of the use of 
Rao’s pivotal condensation technique is found in a 
paper by Collier (2). 

The application of pivotal condensation to the 
problem of transforming correlated variates to a 
new orthogonal set reduces the covariance ele- 
ments in the Sij variance-covariance matrix to 
zero, thus eliminating the element of correlation 
from matrix calculations. The general distance 
function may then be expressed as 


D*di 


D-=— , 
*Sii (4) 


in which *Sjj, the variance of the transformed ith 
variable is found easily in the process of pivotal 
condensation, and each *dj? represents the con- 
tribution of the ith variable to the total D*. 


Significance of Added Variables 





Rao (7,8,9) has considered the following ap- 
proximate test of significance which can be used 
for testing both the significance of the total D? val- 
ue as well as of the addition to D* of each variable 
separately: 


_ (f-p-q+1) nyN9(D* p4q-D* p) 


(5) 
q {(ny+ng)+njn2D*p 





T 


where f = the degrees of freedom of the dis per- 
sion matrix of variance and covariances between 
the ith and jth characters 


p number of variables in Dé 

q = number of variables added to form Dj, 

n; = size of sample or groups from the first 
population considered 

Ny = size of sample from the second population 


T has the F-distribution with q and f-p-q+1 de- 
grees of freedom. Use of this value enables the 
research worker to determine the effect of adding 
variables to the matrix. 
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The Linear Discriminant Function (LDF) 





Let Xjjx be a typical observation where i = 1, 
2 stands for the sample, j for the trait measured, 
and k for the individual. Then the value Y, would 
be determined by the function 


Yik = +1%1 + \2X%2 +... + >pXp (6) 
and 

=_ l s 

Yik nj 2iYik (7) 
where 4j denotes summation over the ith sample 


(i= 1,2). The problem is to maximize the ratio 


Yo - ¥; 
¥E4(¥iK-Y¥1)* + D2(¥ox- Yo)" (8) 





which may be written as 


(2, i) 


ve 
\jm5jm 





eS 
j=1 m=l 
= 


where dj =— 


1 , x . 
ne 2X2jk - ny) Lik for the jth quan- 


tity (j = 1,2,...,p): and Sjm is the pooled sum of 
products of deviations from the sample traits j 
and m. Maximization of the above ratio requires 
essentially setting the derivative of the ratio equal 
to zero, and we obtain 


bP 


s 


z i mS 
m=1 jx) 9 ™ ™ 





d 
{ 7s 
p 
Poa \p\jr> fe ae 


Since the first factor in the above equation will be 
the same for all equations it can be ignored. 
Therefore, the coefficients required are obtained 
through the solutions of the normal equations 


(11) 


The diagonal cells in the above matrix represent 
variances, while all other cells consist of the ap- 
propriate covariances. D* is represented by the 
expression 





= . 
D * 2.4 


and the discriminant function by 


LDF -§ 4 Xj 
i=1 


Classification Procedures 





The application of Fisher’s discriminant func- 
tion to the problem of classification into one or 
two known multivariate normal populations is dis- 
cussed by Anderson (1). Suppose that an individ- 
ual I with a set of measurements Xj,... »Xq has 
been randomly drawn from one of two populations 
Ilj and IIl2, with probability distributions of the 
measurements f1(Xj,...,Xq), respectively. 
Suppose further that X1,..., has a joint normal 
distribution with means in Ij of EXj = uj (1), and 
in Ig of EX; = uj (2). Let the common setof var- 
iances and covariances be 

2 2 
Oy>- ++» pj O12> P13,++- Mp-1> p. 
It can then be shown that the best regions of clas- 
sification are of the general form 


1 2 
iz (uj, a —— (14) 


: (uj), a”) otegk (15) 


where »j,..., “p is the solution of 


(1) (2) 
lee by ios ai - Uj 


and log k = log L(1 17)2 


where L(2] 1) is defined as the loss incurred in 
classifying individual I into Ig when in fact he be- 
longs to IIj, and L(1 | 2) as the loss incurred in 
classifying individual I into lj when in fact he 
belong to II9. 

The first term of Formulas 14 and 15 is the 
same as that to which the LDF simplifies (see 
Formula 13) if the population expectations and co- 
variance matrix are known. Therefore if k= 1, 
i.e., if the probability of belonging to one group 
is equal to coming from the other, then log k = 0, 
and the procedure is reduced to a comparison of 
the LDF of individual observations with the LDF 
of the averages of the respective means. If a pri- 
ori probabilities are not specified, log k is found 
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so that the risk when I is from IIj is equal to the 
risk when I is from IIg. For this purpose let 


q q 
Ey Xi - FEA (uj) + uj) = UK, Kq) (17) 


Since a linear function of normally distributed 
variables is again normally distributed, the dis- 
tribution of U is likewise normal, with the mean 
and variances found as follows: 


E(U) = En E(Xj) - ore (uj (1) on™ (18) 


2 
= Bj (u;"? - a ) (19) 


2 
and = % 


(20) 


1 
Then if I is from Ij, E(Xj) = a , and 
q 1 
E(U) = 2¥ Aj (u;' ) ui =F (21) 


where a is the ‘‘distance’’ between II; and tad), or 
the variance of U. If lis from Ig, E(Xj) = uj 
and 


E(U) = 2 (uj | 4 (22) 


The probability of misclassifying an individ- 
ual belonging to IIj is then determined by refer- 
ring the following relation toa normal distribution 
table: 


(2 <a) (23) 
a 


The probability of misclassifying an individual 
belonging to Ilg is determined by referring the 
following relation to a normal distribution table: 


i 
P(z > leek i | (24) 
a 


In both cases k= IIg Ij. If Ilg= lj then k=1, 
and log k= 0. Then the formula for calculating 
the theoretical probability of misclassification is 
reduced to 


A 
P ( Z > 9~ee) (25) 
a 


The overall probability of misclassification is 
then the sum of products of the population propor- 
tions in each group and the respective misclassi- 
fication probabilities. 


Test of Effectiveness of Linear 





Discriminant Functions 





Lubin (5) has given us the tests of the effec- 
tiveness of the LDF inclassifying individuals cor- 
rectly. Ina two-contingency table let the columns 
Aj and Ag represent the observed categories while 
the rows Bj and B2 represent the predicted cate- 
gories. If perfect discrimination is achieved, only 
the diagonal ceils will be filled. For pure chance 
discrimination 


~ oe (26) 
N 


njj = frequency of the ith row and the jth 
column 

marginal total of the ith row 
marginal total of the jth row 

total number of individuals involved. 


Aj 
B; 
Nn? 


noeuiu 


In case the value of chi-square computed on the 
diagonal values is found to be significant (and a 
chi-square test is a necessary but not sufficient 
test to demonstrate classification beyond chance), 
then the one-tailed t-test for large values of N, 
may be used to demonstrate the effectiveness of 


the LDF: 
_ (o-eVN ‘ 
sg e(N - e) (27) 


where o denotes the number of individuals classi- 
fied correctly, and e the number of correct clas- 
sifications expected by chance. 


Computational Procedures 








In the main study the variance-covariance ma- 
trix representing the equation 

5 

Poth he ac a = uj). wj) Ps 3,..0,8 


was found to be as shown at top of next page. 

This set of simultaneous linear equations for 
Sample A was solved by the method of pivotal con- 
densation. This method reduces the original set 
of variables to a new orthogonal set of uncorre- 
lated variables, and makes possible a test of sig- 
nificance of the contribution of each variable to D*. 
The numerical values of the beta weights and the 
individual contribution of variables to D* are as 
shown in the middle of the next page. 

The value of D? com puted from the product of 
these weights and the difference in means for each 
variable was found to be .9173(i.e., the same that 
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F Hs Sc 


Mf Pd 





11.9238 4 + 1.9755 r2 


17.1353 \3 + 3.50744+ 8.4253>5= 1.5029 


1.9755 \y + 77.1411 \9 + 38.6356 3 + 19. 5887 4 + 30.8370 »5 = 2.2743 


17. 1353 1 + 38.6256 >» 


3. 5074 >; + 19.5887 » 


8. 4253 >; + 30.8370 


28. 5664 3 


50. 1619 »3 


104. 1166 3 + 28. 5664 4 + 50.1619 >5 = 4.9714 


72. 7225 4+ 16. 9858 5 = -3. 5029 


16. 9858 4 + 99.6932 »5 = 6.6629 





Contribution of 
Variables to D? 





. 1894** 
. 0534* 
. 0556* 
. 3750** 
. 2439** 


D? = .9173 





was obtained by adding the contributions of vari- 
ables to D* above). 
The first part of the expression 


is obviously the LDF, andfor any set of scores in 
Sample A, its numerical value will be written in 
terms of the beta weights for D*® as follows: 
p 
LDF=2 Xj 
i=l 


.0695X1 + .0116X2 + .0256X3 
- .0782X4 + .0578X5 


The second part of the expression represents 
the middle point between the sums of the weighted 
means of the two criterion groups and is therefore 
a constant quantity, c. For Sample A the value of 
c is the sum of the products of the beta weights 
and the sum of means, or 


41 (, 0695)(12. 309) + (. 0116)(103. 4514) + 
(. 0256)(119. 0628) + (-. 0782)(103. 6629) + 


(. 0578)(121. 6914)] = 2.0169 


Also, since 


2 (uj)_ u;)) 


q 
is identical with the expression eG it follows 
|= 


that oF - D*, or a = 0.9173, and a¥ = 0.9578. For 


the delinquent sample therefore, the mean of U 
turns out to be $a; for the non-delinquent sample 
the mean is - $a. The numerical value of these 
two statistics was found to be a= .4586 and -a = 
-.4586. 


Solution of the Minimum Probability 





of Misclassification 








By re-working Formulas 23 and 24 above, 
through the use of Formula 25, the probability of 
misclassifying an individual from either popula- 
tion is found to be .3161. Thus 
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TABLE IV 


A SUMMARY COMPARISON OF CLASSIFICATION EFFICIENCY FOR THE LDF PROCEDURES 
WITH CHANCE EXPECTATIONS FOR SAMPLES A" AND B' IN THE SCHOOL STUDY BY 
DELINQUENT AND NON-DELINQUENT GROUPS WITHIN THE SAMPLE 





Correct Classi- Correct Classi- Percentage 
fication by fication by Improvement 
Formula Chance Over Chance 


Samples Used N % % 





Sample A' - Delinquents N 91 60 65. . 50. 15. 9* 
Sample A' - Non-delinquents N 91 72 79. . 50. 29. 1* 
Total Sample A' N = 182 132 72.5 . 50. 22. 5* 


Sample B' - Delinquents N 91 63 . . , .2* 
Sample B' - Non-delinquents N= 91 63 . . , . 2* 
Total Sample B' N = 182 é ‘ . .2* 


All Groups Total N = 364 é ‘ : r . 9* 





*Significant at the 1 percent level 


TABLE V 


CLASSIFICATION OF DELINQUENTS BY LEVEL OF DELINQUENCY USING LDF PROCEDURES FOR 
TWO SAMPLES OF THE SCHOOL STUDY: SAMPLE A' ON WHICH PREDIC TION FORMULAE 
WERE DEVELOPED AND SAMPLE B' ON WHICH THE FORMULAE WERE APPLIED 
THROUGH CROSS-VALIDATION 





Sample A' Sample B' Total 
(N = 91) (N = 91) (N = 182) 
% N % % 





59. 62 65. 
75. 00 67. 
66. 67 7. 
83. 33 76. 
60. 00 57. 

. 00 80. 


Total y ° .o 67. 
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Py "> . 50 
k = po/py = 1.00 


loge k = 0.00 


and lj = Ila = (2 > 000 ee ) or . 3161 


which is equal to 31. 61 percent. 
This means, in effect, that the theoretical 
probability of correct classification for both de- 


linquents and non-delinquents is 68.39 percent. 


This overall percentage is essentially the same as 
that achieved by chance when the ratio of non-de- 
linquents to delinquent in the population is known 
to be 80.5 to 19.5, as it is inthis study. How- 
ever, Classification of delinquents by the LDF is 
more than three times as effective as chance pro- 
cedures, the formula identifying 68.4 percent of 
the delinquents as against 19.5 percent identified 
by chance. 

In actual practice, however, the population 
distribution is not known. In such a case, pure 
chance procedures make possible the identifica- 
tion of a maximum of 50.00 percent of each of the 
two populations. 


Empirical Cross-Validation of 
Samples A and B 








The actual effectiveness of the linear discrim- 
inant function was tested empirically by applying 
the formulas developed on Sample A to the delin- 
quent and non-delinquent sub-samples in both 
Samples A and B. A study of cross-validation re- 
sults as presented in Table II indicates that the 
overall percentage of correct classifications, 
namely 65.9, fell somewhat short of the predict- 
ed percentage of 68.4. The fact that the formula 
correctly identified only 58.9 percent of the non- 
delinquents in Sample B accounts for most of the 
shrinkage. However, 68.6percent of delinquents 
in Sample A, and 70.5 percent in Sample B were 
correctly identified. 

Classification results were also broken down 
in terms of delinquency level, as presented in 
Table Il. The tabulation shows generally in- 
creasing proportions of correct classification with 
level of delinquency, level 5being the only excep- 
tion. 


Results of the School Study 





The set of linear equations composing the var- 
iance-covariance matrix for the six variables 
used in the school study is presented at the top of 
the next page. 

The following is a list of the essential statis- 
tics derived: 


LDF = .0219X + .0282X9Q - .0430X3 + 
.0398X4 - .5247X5 + .0564X¢ 
2.7955 





a = 1.2387; Va = 1.1129; ¢a = . 6194 


Using Formula 25 the probability of misclassi- 
fying an individual from either population is found 
to be equal to 


0.0 - (-. 6194) 


= 02-p(2> “hw 





p (7 >. 5566) = . 2906 


or approximately 29.1 percent, or a probability 
of correct classification for both delinquents and 
non-delinquents of 70. 9 percent. 

Results of cross-validation procedures (see 
Table IV) indicate that classification by the LDF 
permitted the correct identification of 65.9 per- 
cent of the delinquents in Sample A' and of 69. 2 
percent in Sample B'. Corresponding figures for 
non-delinquents are 79.1 percent and 69.2 percent 
for Samples A' and B' respectively. In each case 
improvement over chance is significant at the one 
percent level. 

Although the two studies are not strictly com- 
parable, the additionof school record data raised 
the overall predictive efficiency of the LDF from 
65.9 percent in the main study to 70.9 percentin 
the school study, a difference significant at the 5 
percent level. 

Classification-results in terms of the level of 
delinquency are presented by means of Table V. 
Again, an increase inthe proportion of correct 
classifications corresponds closely with increase 
in level of delinquency, a marked drop in predic- 
tive efficiency at level 7 being the only exception 
in the general trend. 


Summary 


This study has demonstrated the usefulness of 
applying multivariate statistical techniques to the 
analysis of MMPI scale scores alone, or in com- 
bination with school data, for the purpose of clas- 
sifying ninth-grade boys as _ potential delinquents 
or non-delinquents. The techniques employed 
proved to be effective to the extent that 62.3 per- 
cent of the non-delinquents and 69.5 percent of the 
delinquents were correctly identified by the use of 
Multiphasic data alone, while acombination of Mul- 
tiphasic and school record data made possible the 
correct identification of 74.2 percent of non-delin- 
quent boys and 67.5 percent of delinquent-prone 
boys. 
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1. Introduction 





IT IS THE purpose of this note to indicate some 
formulas which are useful for finding the roots of 
real numbers on Friden Calculators. The meth- 
od used to derive the formulas is due to Newton 
although its application for taking advantage of the 
square-root feature on certain Friden Calculators 
is believed to be new. The mathematics of the 
derivations are given in the third section of the 
paper since this information is probably of least 
interest to those who will operate with the form- 
ulas. Thesecond section contains three examples. 
The cube root example is symbolically pro- 
grammed for the calculator in section 4. Ta- 
ble I presents a set of formulas which are effi- 
cient for finding the rth root of a real number. 
That is, for finding the square root, cube root, 
etc., of some number, N. The symbol x in the 
formulas stands for a guessed or trial value of the 
root and conceivably may be grossly in error. 
However, ifitis considerably in error, the form- 
ulas will ultimately provide an accurate estimate 
of the root although it will take a longer time than 
if an estimate only slightly in error were used. 

The method to use with the formulas involves 
essentially selecting atrial value, substituting 
this (as x) in the appropriate for mula and there- 
by obtaining a new trial value. This new value is 
again Substituted inthe formula (as x). This iter- 
ation process is repeated until the desired deci- 
mal accuracy is obtained. 

The formulas in Table I have been arranged so 
that the arithmetic ultimately involves the accu- 
mulation of two quotients. Therefore, the proper 
machine set-up requires the NON ENT key to be 
down. Alsoitis advisable that the ADD keyis in 
the down position. In some of the formulas the 
divisors of x appear as improper fractions (e. g., 
7" In actual computations, these numbers can 








be changed to decimals appropriate to the accu- 
racy required or the problem can be treated as a 
simple quotient (e.g., 3x/4). 

The Square Root Friden enables all roots of the 
form r = 2), where i = 1,2,3,..., of a number to 
be taken directly without the use of any special 
formulas. For instance, to obtain the fourth root 
of 2. 0000 we merely findthe V2 = 1. 41421 and then 
find V1. 41421 = 1.1892 which is an accurate esti- 


i 

mate of 24. The higher order roots are difficult 
to obtain on the standard Friden since the proced- 
ure requires the trial values to be raised to large 
powers. The Square Root Friden is generally 
more efficient for both the high and low order 
roots since the quantities involved in the quotients 
are easier to calculate and moreover the square- 
root formulas provide estimates which converge 
faster to the root desired. 


2. Examples 


a) Consider first the problem of taking the cube 
root of 27. The answer,of course, is 3, but 
nevertheless let us see how the method works on 
a problem for which we know the exact answer. 
Suppose for our example that we wish our answer 
to be accurate to four decimal places. We set the 
decimal on the keyboard, lower dial and upper 
dial to one more than this, 5, and the upper dial 
decimal is also set to2 x 5=10. Recall that the 
NON ENT and ADD keys are down. Since we will 
first use a standard Friden(or use the Square Root 
Friden in the standard manner) and we are seek - 
ing a cube root, the proper formula from Table I 
is: 


N 4 Xx 
3x? 1.5 


We first guess atrial value of x = 2.0000, say, 


*The opinions expressed are solely those of the author and are in no way official; nor are they tobe con- 
strued as representing those of the U.S. Naval Personnel Research Field Activity or Bureau of Naval 
Personnel. 
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TABLE I 


FORMULAS FOR FINDING ROOTS OF NUMBERS BY ITERATION 





Root Standard Friden Square Root 





2 (Square) 


3 (Cube) 


4 (Fourth) 


5 (Fifth) 


6 (Sixth) 


7 (Seventh) 


8 (Eighth) 


9 (Ninth) 


10 (Tenth) 





N = Number 
x Trial Value of Root 
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and copy this value on a worksheet. The number 
x is entered on the keyboard and also in the mul- 
tiplier dial. The MULT key is punched and the 
resulting square is rounded to 5 decimal places 
and recorded on the keyboard. Then 4.00000 is 
multiplied by 3 andthe product, 12. 00000 is 
entered onthekeyboard. The upper dial is cleared 
by hand. Next the number N = 27 is entered with 
the twirler knobs inthe upper dial around the 10th 
decimal place and the dividing keys are pressed. 
Now 2.25000 appears in the lower dial which 
should now belocked. The upper dial is clear but 
in general it will notbe, sothat the CARR CLEAR 
will have to be punched. We have now found the 
first of the two quotients. Next x = 2.00000 is en- 
tered around the 10th decimal place in the upper 
dial and 1.50000 (the constant divisor) is entered 
on the keyboard. After dividing, we find 3.58333 
in the lower dial. Thisnumber is our second ap- 
proximation. After copying this onour worksheet 
we clear the upper dial and lower dial, and unlock 
the lower dial, and proceed to square 3. 58333 
and start repeating the evaluation of the formula. 
After six trials the worksheet would appear as: 


—————aa 
1) 2.00000 
2) 3. 58333 
3) 3.08980 
4) 3.00257 
5) 2.99999 
6) 2.99999 


Because the fifth and sixth trials produce the 
same number, we cease iterating since a stable 
trial value has been determined. Therefore, the 
estimate of the cube root of 27 is 3.0000 which is 
accurate to four decimal places. 

b) The same problem will now be worked using 
the square-rootfeature. The appropriate formu- 
la from Table I, in this case, is: 


- 9 
1.5Vx 


We first calculate directly and record the VN = 
V27 = 5.19615 and V2 = 1.41421 which is the 
square-root of ourfirsttrial value. Next 1.41421 
is copied on the key board and multiplied by 1.5. 
This product, 2.12132, is copied on the keyboard. 
After clearing the upper dial by hand, 5.19615 is 
entered with the twirler knobs in the upper dial 
around the 19th decimal place. Division gives 
2.44948 in the lower dial which should now be 
locked. Clear the upper dial. Then enter x = 
2.00000 around the 10th place in the upper dial. 
Place 3.00000 (the constant divisor) on the key- 
board and divide. The accumulated quotient is 
3.11614 and this value is recorded on the work- 
sheet as this is our second trial value. Next we 
calculate v 3.11614 and repeat the process. The 


+ x 
3 





lower dial should be unlocked when starting a new 
iteration. The worksheet would appear as: 


x 


2. 00000 
3.11614 
3.00108 
3. 00000 
2. 99999 


VOT = 5.19615 


Notice that one less iteration is required for the 
same problem by this method than for the method 
used in sample a). 

c) As a final example consider the problem of 
finding the fifth root of 7 = 3.14159 = N, using the 
Square Root Friden. From Table! the formula 
to be used is: 


eS 


N4 
bo 
1. 25x* 


+~_- 


a 1 
The first task is to find N* = (3.14159)4. Since 
V3.14159 = 1.7724531, then Nt = V1.7794531 - 


1.33134. This number is recorded on the work- 
sheet. Because the fourth root of 7 is about 1.3, 
a good trial value for the fifth root is x = 1. 20000. 
The fourth root of x is now found and multiplied 
by 1.25. The resulting product, 1.30830, is re- 
corded on the keyboard and 1. 33134 is entered 
around the 10th decimal place in the upper dial. 
Division gives 1.01761 and the lower dial is locked. 
Clear the upper dial, enter 1.20000 in the upper 
dial around the 10th decimal place, and record 
5.00000 (the constant divisor) on the keyboard. 
Division produces 1.25761 in the lower dial and 
this value is the new x. Afterfour trials, the 
worksheet appears as: 


Trial x 
i 
1) 1.20000 (3.14159)4 = 1.33134 
2) 1. 25761 


3) 1. 25726 
4) 1. 25727 


Therefore, the fifth root of 7 is 1.2573 which is 
accurate to four decimal places. 


3. Mathematical Derivations 





Newton’s method for finding roots of equations 
is well-known and is described in great detail in 
many theories of equations andcalculus textbooks. 
Essentially the method says that the roots of an 
equation in x are approximated by the following 
expression:' 
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where f(x) is the function (or equation) of x, the 
f' (x) is its derivative. The functions of x used in 
this paper are very simple since we seek only the 
values of x satisfying the equation x‘ = N, where 
r is some positive integer. Of course f(x) = x" - 
N. We could consider more general functions but 
the result would be to introduce types of problems 
not frequently attempted. The derivative of f(x) 
is f' (x) = rx?-1 so that: 





f(x) + (x¥ - N) ; N . x 
f* (x) , rxr-l r/(r -1) 


rxr-l 


Upon substituting r = 2,3,...,10 in the expres- 
sion, we produce the formulas under the ‘‘Stand- 
ard Friden’’ heading of Table I. 

It occ urred to the author that it may be more 
efficient to utilize the square-root feature on cer- 
tain Friden Calculators to find the root of num- 
bers. Accordingly, f(x) was modified to be: 


i 
f(x) = x"/2 - Na 
Newton’s expression leads to: 


VN xX 

rx (P-2)/2 r/(r-2) 

2 
in this case. Substitution of r = 2 and 3 in the 
above expression gives the formulas for the square 
and cube roots in the second formula column of 
Table I. One could also use r = 4,5,..., to gen- 
erate iterative formulas for other roots, but it 





i 
appears more wise to use f(x) = x'/4 - N# for r = 
4,5, and 6. This was done and the expression is: 


5 x 


r/(4-4) 


For r = 7,8 and 9, one might consider f(x) = xl 8 


-NI1/8, 

The general principle being used is now clear. 
The Square Root Friden formulas are derived 
from Newton’s een y ~ utilizing a function of 
the form f(x) = x?/K - N1/K where k = 2,4, 8, 16, 

., Newton’s formula for the general case is: 


ni/k 


+ x 
rx (t-K) b T/(r-k) a * 
k 


To find the rth root of N, locate r between two 
successive k’s where k is an integral power of 2. 
In case r is equally distant between two k’s use 
the smaller k. Substitute k andr in Newton’s gen- 
eral formula above and solve for the specific iter- 





ation formula. Particular formulas aregivenin 
the third column of Table I. 


4. Programming a Desk Calculator 





The art of computation by means of desk calcu- 
lators has been taught primarily by the demon- 
stration of technique by the more experienced to 
the less. This unfortunate situation exists because 
it is usually much easier to show a student how to 
use a desk calculator than to tell him how, and ad- 
equate instructional materials arescarce. Calcu- 
lator manufacturers have not been too successful 
in producing manuals which have been useful to 
novices. Frequently manuals and books on desk 
calculators have attempted to illustrate computa- 
tions for particular problems. Consequently, it 
is often the reader’s unpleasant duty to attempt to 
adapt the principles or steps of the computation 
(if they can be discovered) to the spec ial task at 
hand. 

Some of the difficulty could be overcome if an 
efficient code were devised for the instructions. 
While the author was engaged in writing statistical 
programs for a digital computer, it occurred to 
him that instructions similar to those used by com- 
puters could be devised for desk calculators. An 
extensive list of instructions for Friden calcula- 
tors has been defined. Rather than list these in- 
structions here, suffice it to say that they refer 
to a) the set-up of the registers and keys of the 
calculator, b) arithmetic operations, c) the trans- 
fer of information, d) branching and decision mak- 
ing, and e) tallying and control. These desk cal- 
culator instructions could be given a mnemonic 
code (such as CAD for ‘‘Clear and Add’’) in the 
same manner as digital computer instructions, 
but this does not seem particularly necessary and 
would impose an additional learning task. The 
advantage of desk calculator instructions include 
the increased generality of computation illustra- 
tion, the compact description of computations, 
and a permanent and convenient record of proced- 
ures. The author believes that the actual demon- 
stration of technique remains the most efficient 
method of learning desk calculator computation 
but he also believes the programming of often 
used and important procedures bears serious con- 
sideration, particularly for instruction manuals, 
and statistical and computation laboratories. 

In order to illustrate how the programming 
might appear, aprogram for finding the cube root 
of N is provided in Table Il. The reader is invit- 
ed to followsimultaneously the verbal description 
in the first example in section 2 of this paper and 
the commands in Table II. Each command is giv- 
en a number called a Step Number and a Refer- 
ence. The Step Number provides the order in 
which the command is executed. The Reference 
refers to such things as a set of registers, a con- 
stant or variable, pair of numbers in registers, a 








"GQ UOTJONI}SUT 0} OH 
*x TelI} Mau ay} se © ul JaumMu sauTjeq 
*suotjemMores dois 
*J290YS ByEp UO 6 /N + X p10098y 
*sTewTDep p 0} © punol ‘renbes J 
*jenbaun aie paredwuos 
s1aquinu fl 9Z UOT}ONI}SUT 0} OH 
*sTeulloep p 
0} X }SE] YIM J9}SIZeI © aredwmo0g 
“MH pue y jo wuatjond purg 
*S *T 0} prvoghay jag 
*sTewloep ([+p)z YIM x 0} Y 39S 
*aZElIIED IdID 
*19}SIBZ91 JUITIONY yoo] 
"MX pue y jo quanonb pury 
*SIOTIIM} [VIP 9SQ  “STeuTDep 
(I+P)Z punore N 0} 1a}stT381 Y jag 
puvy Aq siajst301 © pue y Ie91D 
*prvoghsy 0} 19}SiZa1 Y 1ajsuery 
*STeulTep [+p 0} Y UT JONpoid punoy 
*¢ Aq Adu mw 
‘prvoqhay 0} 19}siZ81 Y Iajsuelry 
*STeEUITNEp [+p 0} Y UT JONpoid punoy 
‘sTewmloep [+p yim x Aq Adm 
*“STEUITIEp [+p YIM x 0} prvoghay jag 
*jaays eyeEp uO 
*x ‘yoor aqnd ayeutxoidde pr0os0y 
‘umop Avy ppy 39S 
‘umop Aay Arjuq UON 39S 
*sTeWIDap ([+p)zZ 0} 19}sI3e1 VY aS 
*STEWITDEp [+p 0} S19}STZeI Jag 
*UOT}IPUOD PIepUL}S UI JOJEMITeD Jag 


aac 


< 


N 
© 
ui 
Vv 
€ 
MI 
V 
x 
x 


* 


NMOGd 
NMOd 


V 
Ov NH 


HONVUd 
ANIA 
L'TVH 
quooru 
aNnouw 


TVNOANN ‘HONVU 


TAVdWOO 
AdIAIG 
quvodAda 
Las 

UVATO “WUVO 
M90'T 

AqCIAIG 


Las 
UVvaTto 
UAASNVaAL 
annou 
ATdLIL TAW 
YUaAASNVUL 
aNnnouw 
ATdLL TOW 
aquvodaArds 


quoodu 

adv 

AULNA NON 
TIVWIOdG 
IVWIOAIG 
AZIGUVGNV.LS 





syIeweYy 


aouaIa joy 


puewwoy 





YOLV TAOTVO NAGMd GUVGNVLS V 
NO NOLLVUSLI Ad N JO LOOYW AGND AHL ONLLNUWOO YOd WyudDoUud 


Tl ATaVL 





234 JOURNAL OF EXPERIMENTAL EDUCATION 


direction on how to set aregister or key, ora 
step number. The Reference is divided essential - 
ly into two parts. The first part contains such 
things as those items just mentioned, and the sec- 
ond part (separated from the first by a colon) al- 
ways specifies the decimal accuracy or decimal 
position if it is involved in the command. 

The commands have not been specifically de- 
fined. By reading the ‘‘Remarks’’ column, the 
reader should have little difficulty in determining 





the proper interpretation. Before the reader con- 
siders Table II, he should know that the symbols 
used are defined as: K = Keyboard, M = Multi- 
plier Dial, A= the Upper Dial or Register, Q = 
the Quotient or Lower Dial, d = thedecimal accu- 
racy desired, and x = atrial value of the cube 
root of N. The command ‘‘STANDARDIZE’”’ means 
that all the dials and registers of the calculator 
are unlocked and cleared, and all the operational 
keys are put in their usual (UP) position. 
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MUST ALL TESTS BE MULTI-FACTOR 
BATTERIES? 


EDWIN F. ROSINSKI 
Medical College of Virginia 
Richmond, Virginia 


MUCH OF the present day literature on meas- 
urement concerns itself with the application and 
use of multi-factor test batteries. This trend has 
become so pronounced that journals are publish- 
ing series of articles on the use of these batteries. 

In one of these series of articles, Super stated 
that if the detailed data on validity and norms in 
the Psychological Corporation’s Differential Ap- 
titude Tests and the United States Employment 
Service’s General Aptitude Test Battery are as 
adequate as claimed, then: 


....the use of singly developed and validat- 
ed aptitude tests in counseling will in due 
course be virtually a thing of the past: the 
multipotentiality of the multifactor test bat- 
tery gives too many advantages. 1* 


It appears as if Super’s statement is not so 
much a prophecy but a recognition of a reality that 
is already present. With a pronouncement such 
as this and others2 the ultimate question arises: 
Need single score tests be replaced by multi-fac- 
tor batteries? It appears, however, that before 
this question can be answered, there is a need 
first to investigate the relationship between these 
single score tests and mul ti - factor batteries to 
ascertain if eachis performing a different function 
or the same function. The Psychological Corpor- 
ation’s Differential Aptitude Tests (hereinafter 
referred to as the D. A. T.) and the World Book 
Company’s Otis Quick Scoring Mental Ability Test: 
Gamma Form (hereinafter referredtoas the Otis) 
were used to study this relationship. The deci- 
sion to use these two tests appeared practical for 
two reasons: (1) the D.A.T. as a multi-factor test 
is gaining in prominence and use and includes in 
the manual a great deal of standardization data, 
and (2) the Otis is one of the most commonly used 
tests providing a single score IQ. 


Procedure and Results 





A sample of 153 students of a senior high-school 
class was selected. From the permanent record 





* All footnotes will be found at end of article. 





cards of these senior students the appropriate 
D. A. T. scores andthe Otis IQ scores were taken. 
The former test had been administered to the stu- 
dents when they were inthe tenth grade and the 
latter when the students were in the twelfth grade. 
There is no appar ent reason to suspect that this 
recognizably selected sample should in any way 
contaminate the results. Using these raw data 
from the Otis andthe D. A. T. scores, means, 
standard deviations and Pearson product-moment 
correlations were obtained. 

After these correlations were found, it was nec- 
essary to determine partial r’s and partial stand- 
ard deviations.” Next the formula for Multiple R 
was used and a multiple correlation of .82 was 
computed. Not only does this prove to be signifi- 
cant at the .01 level of confidence but it is an ex- 
tremely high correlation. 

The next step involved the determination of the 
multiple regression equation or predicting the first 
variable (Otis) on the basis of the two known vari- 
ables (D. A. T. Verbal and Abstract Reasoning Raw 
Scores). With the calculation of the partial re- 
gression coefficients, values were substituted in 
Cornell’s# multiple regression or prediction equa- 
tion and the Otis IQ score might then be predicted. 
This equation reads: 


X, = Dias Xe + Dig.2Xy + a 


where X, is the predicted Otis score; b,2,3 and 
b, 3.2 the partial regression coefficients .51 and 
.89 respectively; X, the D.A.T. Verbal Reason- 
ing raw score; X; theD. A. T. Abstract Reasoning 
raw score; and ‘‘a’’ the constant value in the equa- 
tion determined to be 70. 5. 

Therefore, with the above equation and known 
values, a hypothetical examplecanbe tested. As- 
suming that student A has a D. A. T. Verbal and 
Abstract Reasoning scores of 29 and 34 respec- 
tively, what will his predicted Otis IQ be? 

Substituting in the formula we have: 


X, = bi2.sK2 + bis. 2X3 + a 
X, = .51(29) + .89(34) + 70.5 
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TABLE I 


ACTUAL AND PREDIC TED OTIS (GAMMA) IQ SCORES WITH 
DIFFERENCES BETWEEN THE TWO 





Actual Predicted Actual Predicted 
Student Otis IQ Otis IQ Difference Student Otis IQ Otis IQ Difference 





108 
104 

99 
112 
104 


104 104 
92 99 
115 108 
96 99 
106 104 
116 108 
101 102 
106 101 
109 108 
103 90 
123 129 
92 83 
110 91 
107 106 
111 108 
87 86 
106 121 
115 114 
97 100 
108 111 
109 110 
110 114 
110 

93 

100 

99 

101 

107 

115 

92 

116 

109 

117 

123 

100 

107 

100 

98 

121 

102 
110 
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0 
7 
7 
3 
2 
8 
1 
5 
1 
3 
6 
9 
9 
1 
3 
1 
5 
1 
3 
3 
1 
4 
9 
5 
3 
2 
4 
4 
0 
0 
4 
7 
2 
2 
8 
1 
3 
1 
6 
1 
6 
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TABLE Ill 


NUMBER AND PERCENT OF PREDIC TIONS MADE WITHIN CERTAIN 
DIFFERENCES BETWEEN ACTUAL AND PREDIC TED IQ’S 





Diff. Between 
Actual and Pre- No. of Percent of Cumulative 


dicted IQ Predictions Predictions Percentage 
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8 
8 
) 
8 
2 
5 
1 
1 
0 
3 
2 
l 
1 
0 
1 
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TABLE IV 


MEANS, STANDARD DEVIATIONS AND DIFFERENCES IN 
PREDIC TED AND ACTUAL IQ SCORES 








Actual Predicted Difference 





Mean 106.0 106. 2 0.2 


Standard 
Deviation 9.8 10.1 
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X, = 14.79 + 30.26 + 70.5 
X, = 115.55 


Rounded off to the nearest hundred, we have stu- 
dent A’s predicted Otis IQ as 116. 

Nevertheless, the task of predicting presented 
another problem. In attempting to predict sever- 
al Otis IQ’s it quickly became obvious that such a 
procedure of substituting in aformula was suitable 
with a small number of cases. On the other hand, 
if this was to be accomplished for a large group, 
it would be much too time consuming. Conse- 
quently, a prediction table was constructed by 
which the Otis score could be easily determined 
once the D. A. T. Verbal and Abstract Reasoning 
scores were known. Such a table was completed, 
but since itis large (49 by 47 columns) only a por- 
tion is here reproduced (Table I). 

To test the prediction equation or, as it devel- 
oped in this case, the predictiontable, a new sam- 
ple of students was selected. An entire senior 
class of 82 students in the same school where the 
original correlations were made was used. Again 
from the permanent record cards of these stu- 
dents the D. A. T. Verbal and Abstract Reasoning 
and the Otis scores were transcribedon a master 
work sheet. From the D.A.T. data the predicted 
IQ’s were made and compared with the actual IQ’s. 

While the task of developing and recording the 
figures for the prediction table was tedious, the 
effort was rewarded by the ease with which the 
predicted scores were made. 

Table II represents the findings of this proce- 
dure of predicting scores: the actual IQ scores 
along with the predicted IQ scores and the differ- 
ences between the two. Prior to reading the table 
it is important to remember that one standard er- 
ror of estimate for this prediction is 4.3 and the 
usual procedure is to expect, by the definition of 
a standard error of estimate, approximately 90% 
of the predictions within two standard errors of 
estimate, that is 90% of the predictions should fall 
within 8.6 (9) points of the predictedIQ. Table III 
shows the number and percent of predictions made 
within a certain difference. 

From these two tables itis obvious that the 
predictions are not all perfect. This might well 
be expected; for perfect correlations, while highly 
desirable, are extremely unlikely. Yet, it does 
appear significant when considering the number of 
predictions that fell within thetwo standard errors 
of estimate and the reliability of the Otis itself. In 
this case, the prediction table and/or prediction 
equation appear to have some value. 

Perhaps citing another figure might be more 
illustrative of the value of this prediction, for 
studies of such anature as this are primarily con- 
cerned with mean scores andmean values. There- 
fore, when comparing the mean IQ’s and standard 
deviations of the predicted and actual scores in 
Table IV, the accuracy of the predictions seems 





more vivid. 

The greatest value, perhaps, of this study lies 
in this table. Itappears almost extraordinary that 
predicting group mean IQ’s couldbe accomplished 
with such a degree of accuracy as illustrated here. 


Summary 


In correlating the D. A. T. Verbal and Abstract 
Reasoning subtests with the Otis, a multiple cor- 
relation coefficient of .82 was found. From the 
multiple correlation coefficient, the prediction 
equation was developed and predicted 90% of the 
cases within two standard errors of estimate. 
While all the predictions were not perfect, the 
Multiple R was almost identical to the Otis relia- 
bility coefficient. In addition, while differences 
in individual predictions were noted, the predict- 
ed mean group IQ’s were as close to the actual IQ’s 
as possible (106 and 106. 2). 


Conclusions 


From the data presented in this study, several 
conclusions seem appropriate. 


1. Because of the significant and high Multiple 
R (. 82) itappears as if the D.A.T. Verbal and Ab- 
stract Reasoning subtest are performing the same 
function as the Otis, i.e., measuring intelligence. 

2. Because of the significant and high Multiple 
R, it appears as if the D.A.T. Verbal and Abstract 
Reasoning subtests can predict Otis IQ scores. 

3. In actually predicting Otis IQ’s on the basis 
of D.A.T. Verbal and Abstract Reasoning subtests, 
approximately 90% of the predictions will fall with- 
in nine points. 

4. Predicting mean Otis IQ’s for groups can be 
accomplished with considerable accuracy and con- 
fidence. 


Implications 


Several implications manifest themselves as a 
result of this study. If the multi-factor tests 
measure the same factors as some single score 
tests, then it only seems logical that they might 
ultimately replace these single score tests, but by 
the same token if they are measuring the same 
factors as single score tests, why not continue us- 
ing these single score tests? It appears that even 
though the multi-factor tests can, with some de- 
gree of accuracy, dothesame task as single tests 
the rush toward these multi-factor tests should be 
a considered one until the predictive value of these 
tests is substantiated with proven data. This is 
especially pertinent when considering the amount 
of time that each test takes. The Otis requires 
thirty minutes testing time while the D. A. T. re- 
quires approximately three hours, not to mention 
the time required for directions and scoring. 
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An argument for multi-factor test batteries 
could be the possible multipotentiality of the multi- 
factor tests, i.e., that the various subtests could 
be recombined to predict other phenomena. 

What it seems to boil down to is this: that both 
tests tend to measure the same factors, but be- 
cause they measure the same factors it does not 


FOOTNOTES 


1. Donald E. Super, ‘‘The Use of Multifactor 
Test Batteries inGuidance,’’ Personnel and 
Guidance Journal, XXXV (September 1956), 
p. 14. 

2. 1955 Invitational Conference on Testing Prob- 
lems, sponsored by the Educational Testing 
Service of Princeton, New Jersey. 

Robert D. North, The Use of Multi- Factor Ap- 

















necessarily follow that one should replace the other. 
Test users perhaps should consider some gross 
criteria before deciding on a particular test: (1) 
the purpose behind using a particular test, (2) the 
time allotted to accomplish the particular purpose, 
and (3) what the testor and testee are most com- 
fortable in handling. 


titude Tests in School Counseling Proceed- 
ings of the 1955 Invitation Conference on 
Testing Problems, New York, October 29, 
1955, pp. 11-15. 

3. Any standard test on statistics in psychology 
and education provides formulae for these 
values. 

4. Francis G. Cornell, The Essentials of Educa- 
tional Statistics (New York: John Wiley and 
Sons, 1956), p. 320. 
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AN EVALUATION OF THE LABORATORY IN 
A COLLEGE GENERAL BIOLOGY COURSE 


DOUGLAS M. 
University of 


THE RESEARCH reported to date in the liter- 
ature indicates that no one type of laboratory is 
clearly superior indeveloping all of the outcomes 
desiredfrom the science laboratory. The accept- 
able science studies indicate, for example, that 
the conventional type of laboratory has proven su- 
perior to other laboratory methods in teaching 
laboratory skills, but not in promoting the acqui- 
Sition of facts. 

The research reported in this study was con- 
ducted to hel p determine the contribution of sev- 
eral types of biology laboratories in a college bi- 
ology course designed for freshmen and sopho- 
mores who do not plan to major in a science. 


Experimental Treatments 





The four experimental treatments compared 
were: (1) individual (conventional) laboratory, (2) 
demonstration laboratory, (3) workbook exer- 
cises, and (4) a biological term paper. 

The individual laboratory work was _ based on 
the laboratory work found in many of the published 
college laboratory manuals, and involved a week- 
ly two-hour laboratory period. The dem onstra- 
tion laboratory required one laboratory hour per 
week. The laboratory instructors had set up ex- 
periments and materials prior to the laboratory 
period for the students to observe andstudy. This 
demonstration type of laboratory eliminated many 
of the more routine aspects of the laboratory work 
for the student and allowed them to concentrate on 
the important points and principles. 

The workbook exercises were submitted week- 
ly and were based on questions in the demonstra- 
tion laboratory manual. The answers to these 
exercises had to be obtained from non-laboratory 
sources, suchas texts. The biological report 
was used as an experimental control. 

The lecture portion of the class was presented 
by one lecturer who kept the lectures as constant 
as possible for all sections of the class. 


Measuring Instruments 





Three tests were used to evaluate the achieve- 
ment of students random1y assigned to the four 
experimental treatments. The students took these 
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tests before beginning the course, upon comple- 
tion of the course, and three months after com- 
pleting the course. The three tests were tests of 
Biological Knowledge, Scientific Thinking, and Bi- 
ological Attitudes. These tests were validated and 
the reliability coefficients for the tests were com- 
puted. The reliability coefficients were computed 
by the split-halves method, using the maximum 
likelihood estimate of the correlation coefficient 
and then correcting this coefficient with the Spear- 
man-Brown formula. The corrected reliability 
coefficients were 0.87 for the Biological test, 0.86 
for the Biological Attitude test and 0.85 for the 
Scientific Thinking test. 


Experimental Population 





The students involved in the experiment were 
the 924 students who comp] eted the one quarter 
General College Human Biology course, G.C.10-A 
during the 1958-59 academic year. These stu- 
dents were enrolled in the General College, Home 
Economics, Education, Mortuary Science, and the 
College of Science, Literature, and the Arts. The 
students in the six sections of this course were 
randomly assigned in approximately equal num- 
bers to the treatments through the use of a table 
of random numbers. 


Experimental Design 





A stratified randomized design was used in this 
study, with the experimental treatments being 
randomly assigned over all of the experimental 
material. The students were stratified for col- 
lege and sex prior to being assigned to treatments 
as a means of more equally distributing the stu- 
dents of various abilities. 

The desirable statistical tool totest the null hy- 
potheses of the study was the analysis of variance. 
Of the two main models of this analysis, the fixed 
effects model, Model I, is designed for studies 
such as thepresent study. T-tests, Duncan’s Mul- 
tiple Range Test, and the Johnson-Neyman Tech- 
nique were also included in the statistical analyses. 


Experimental Results 





A comparison of the initial and final test scores 





“ 


by means of t-tests indicated that each of the four 
treatments had significantly increased student’s 
scores on the three tests. The treatment which 
most consistently inc reased the variance on the 
final tests was the individual laboratory, which 
suggests that this treatment allowed more ade- 
quately for individual differences in student aca- 
demic ability. 

The overall results from the experiment are 
presented in Tables I, I, and III which give the 
means of the tests given at the end of the experi- 
mental period for each of the six sections. 

When the final Biological Knowledge Test scores 
were analyzed by a two-way analysis of variance 
(treatment and ability level) none of the treatments 
was signifcantly better in increasing student 
scores. Table IV presents the analysis of vari- 
ance results for the lecture section which most 
closely approached significant results between 
treatments on this test. 

The conclusion which resulted from the analy- 
sis of the Biological Knowledge test was that none 
of the four treatments was significantly better in 
helping students learn the factual aspects of biol- 
ORY. 

The Scientific Thinking tests were similarly 
analyzed by the analysis of variance technique to 
determine which treatment functioned best in im- 
proving scientific thinking by students. The dif- 
ferences in achievement onthis test were not sta- 
tistically significant, indicating that none of the 
treatments was superior in improving scientific 
thinking by students. 

When the Biological Attitude Test results were 
analyzed, significant differences were found be- 
tween the treatment groups in two of the lecture 
sections. The significant differences in section 
one, Fall quarter (Table V) were further analyzed 
by Duncan’s Multiple Range Test, which indicated 
that the individual laboratory group of this section 
had significantly higher attitude test scores than 
the other three treatments. In section three, Fall 
quarter, the workbook group was significantly 
higher onthis test thanthe other three treatments. 
However, thetotal results do not indicate that any 
one treatment was consistently superior in im- 
proving student’s scores on this attitude test. 

The preceeding analysis of variance results al- 
so were used to determine if any treatment was 
superior so far as being more suitable for stu- 
dents of comparatively high, middle, or low aca- 
demic ability. The chosen criterion of academic 
ability was the student’s ACE scores. The analy- 
sis of variance results indicated that the students 
of a given ability group achieved equally well on 
the tests no matter whichtreatment they were as- 
signed to. Thus none of the treatments proved to 
be better for the students of any of these three 
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comparative levels of ability. 


Analysis of the Follow-up Tests 





Three months after completing the experiment- 
al course, arandom sample of students again took 
the tests used inthis study. Of the 122 students 
selected, 79.5 percent took the three tests. T-tests 
for correlated measures were used to compare 
the final and follow-up test means and variances. 

The students in all treatment groups tended to 
score lower on the follow-up tests of Biological 
Knowledge and Attitudes. However, each of the 
treatment groups increased their scores on the 
follow-up Scientific Thinking Test. In comparing 
the variances of the final and follow-up tests, the 
individual laboratory group most consistently in- 
creased the variability in scores on the follow-up 
tests, indicating that this treatment allowed more 
adequately for individual differences in learning. 

A two-way analysis of variance of the follow- 
up tests (treatments < number of additional biology 
courses taken by students) indicated that the four 
treatments were not significantly different in pro- 
moting a more permanent learning by students of 
biological attitudes, facts, or scientific thinking. 
This conclusion held both for those students who 
did or did not take an additional biology course 
after completing the experimental course. 


Johnson-Neyman Technique* 





While no significant differences were found be- 
tween the treatments on the Biological Knowledge 
and Scientific Thinking Tests, some differences 
were noted on the Biological Attitude Test. Thus 
it was advisable to collate the information from 
all of the six single experiments to see if the com- 
bined data indicated any significant differences be- 
tween the treatments. A qualitative modification 
of the Johnson-Neyman Technique was used to 
combine and collate the data from the six sepa- 
rate experiments. 

In this analysis, two treatments are compared 
at one time. For example, when the combined 
demonstration laboratory groups were com pared 
with the combined individual laboratory groups on 
the Attitude test, the difference (D) in mean final 
Attitude test scores between the two treatments 
and the variance (Sp) of D were computed. Then 
the value of tg was calculated, where 


Referring to the table oft, with the appropriate 
degrees of freedom, the null hypothesis of no mean 
differences bet ween the two groups could be re- 


* The technical background for this technique is presented in the text by Palmer O. Johnson and Robert Jack- 
son, Modern Statistical Methods: Descriptive and Inductive (Chicago: Rand McNally andCo., 1959), 427-8. 
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TABLE I 


MEAN SCORES OF STUDENTS ON THE FINAL BIOLOGICAL KNOWLEDGE TEST 





Lecture Section 





Sec. 3 Sec. 1 Sec. 2 
Treatment Fall Winter Winter 


Sec. 1 
Spring 





Demonstration Lab. 52. 92 50. 64 55. 50 
Workbook 55. 33 51. 29 53. 36 
Report 53. 67 49. 69 52. 60 


Individual Lab. 56. 42 46.13 52. 83 


48. 81 


53. 05 


50. 42 


50. 62 





TABLE II 


MEAN SCORES OF STUDENTS ON THE FINAL SCIENTIFIC THINKING TEST 





Lecture Section 





Sec. 3 Sec. 1 Sec. 2 
Treatment Fall Winter Winter 





Demonstration Lab. 28. 82 30.73 32. 97 
Workbook 32.33 32. 26 33. 41 
Report 29. 84 30. 00 32. 29 


Individual Lab. 29.75 28. 33 29. 50 





TABLE II 


MEAN SCORES OF STUDENTS ON THE FINAL BIOLOGICAL ATTITUDE TEST 





Lecture Section 





Sec. 3 Sec. 1 Sec. 2 
Treatment Fall Winter Winter 


Sec. 1 
Spring 





Demonstration Lab. 11. 82 13. 73 12. 81 


Workbook 14.15 13.76 13. 23 


Report : 13. 86 12. 38 14. 07 


Individual Lab. 12.75 12. 60 13. 22 


12.57 


13.19 


15. 46 


13.19 
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jected or accepted. 

Using this technique to compare each of the 
combined treatments with every other combined 
treatment, the conclusion was drawn that none of 
the mean test scores were significantly higher at 
the .05 level of significance. Thus, at this level, 
none of the treatments was significantly better in 
improving the mean final scores of students on 
these three tests. However, at the .10 level, the 
individual laboratory, when compared with the 
demonstration laboratory, was associated with a 
significant increase in scores on the Biological 
Attitude Test. A summary of the results of all of 
these comparisons is presented in Table VI. 

A sample of the experimental gr psof students 
was compared with two random samples of stu- 
dents who had completed this course several years 
ago. The ACE scores of these samples of stu- 
dents were compared by an analysis of variance 
technique. The experimental group was not sig- 
nificantly different, with regard to ACE scores, 
from the students of previous years. Thus the 
results of this experiment can be generalized to 
that population of students taking the experimental 
course, assuming that the characteristic of future 
students do not change. 


Summary 


Four experimental treatments (individual or 
conventional laboratory, demonstration laboratory, 
workbook exercises, and a biological term paper) 
were Statistically evaluated in a coll ege general 
biology course. Tests of Biological Knowledge, 
Scientific Thinking, and Biological Attitudes were 
used to compare the achievement of students ran- 
domly assigned to these treatments. 





A significant increase intest scores was asso- 
ciated with each treatment at the end of thein- 
structional period, but none of the treatments was 
superior to the other treatments in increasing 
student’s scores. Each of the treatments allowed 
for individual differences in the academic ability 
of students, but the individual laboratory allowed 
more adequately for these differences in learning 
ability and thus allowed the students to learn at a 
rate more consistent with their ability. 

In comparing the test results of students with 
comparatively high, middle, and low academic 
ability, none of the treatments was found to be 
more suited to students of a particular level of 
ability. 

At the .10 level, the Johnson-Neyman Tech- 
nique indicated a difference between two of the 
treatments. When the individual laboratory and 
demonstration laboratory groups were compared, 
the individual laboratory groups had significantly 
higher mean test scores onthe Biological Attitude 
Test. 

None of the treatments was significantly supe- 
rior in promoting a more permanent learning of 
the course materials, as measured by tests ad- 
ministered three months after the students had 
completed the experimental course. 

The results of experimentation can provide use- 
ful practical results. Based on the results of this 
experiment, a combination of these methods could 
be used to good advantage. The advantages of all 
four treatments could be incorporated into acourse, 
with particular emphasis being placed on the indi- 
vidual laboratory because of its better adjustment 
to individual differences in learning ability of stu- 
dents. In addition, if the four treatments were 
carefully integrated intoa course, student interest 
in the course materials could be increased. 
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APPLICATION OF MATRICES IN THE ANALY- 
SIS OF SOCIOMETRIC DATA 


JOHN X. JAMRICH 
Michigan State University 
East Lansing, Michigan 


A NUMBER OF research workers in the field 
of sociometry have clearly identified the inad- 
equacy of the ordinary methods used in the analy- 
sis of sociometric data. Briefly, the inadequacies 
stem from the fact that ordinary methods of anal- 
ysis do not determine, uniquely, status levels of 
the individuals or of sub-groups of individuals in 
the group being studied. Furthermore, present 
methods lack specific quantitative characteristics 
to permit analysis of change in the social structure 
of agroup, in the relative status of an individual 
within the group, orofthe sub-groups, usually re- 
ferred to as cliques. 

Katz* has presented a mathematical derivation 
of a ‘‘status vector’’ of agiven group. The deri- 
vation of the numerical elements of this vector is 
a relatively cumbersome process, involving the 
solution of ‘‘n’’ simultaneous equations, where ‘‘n’’ 
is the number of individuals comprising the group. 

It is the purpose of this paper to present the de- 
velopment of a relatively simple numerical char- 
acteristic for sociometric data which will permit: 


1. determination of the status of an individual 
within the group, taking into account his choices 
and the kinds of choices by others in which he 
is involved; 


. determination of a characteristic vector for the 
entire social group so that it may be compared 
with other groups and with sociometric data for 
the same group at a later date; 


. the identification of cliques, and their relative 
status within the group, andchanges which may 
be taking place from time to time as new soci- 
ometric data are gathered for the same group. 


The understanding of the development and com- 
putation is based onthe simple concept of addition 
and multiplication of matrices. By definition, the 
sum of two matrices: 








aj} aj2 
(1) My, = 
421 é b22 


a2 + ite 
ago + bg 


The product of two matrices, with elements as 
above, is defined as: 


is given by 

aj, + byy 
(2) My+Mo= 

ag; + bg 


411 by2 + 442 bee 


(3) 441 by1 + 412 bay 
Mj -M9= 


491 by, + 422 bgy «ag, byQ + agg bo2 


Thus, if we have a matrix 


(3a) 


011 


100 


(5) My+Mj= 


For later references, it should be pointed out 


*Leo Katz. ‘‘A New Status Index Derived from Sociometric Analysis,’’ Psychometrika, XVIII (March 


1953), pp. 39-43. 
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that the sums of the columns of the matrix My + 
Mj are 


(6) (5, 5,3). 


Consider, as an example, the following simple 
sociogram involving three individuals, Aj, Ag, 
and Ag, each individual having made one or two 
choices of equal ‘‘strength. ’’ 


A3 


A2 


The matrix representing this sociogram will be: 
A; A2 A3 
Aj 
Mj A? 
A3 


In the above matrix (8), the entries in the col- 
umns indicate that Aj is chosen by certain of the 
other individuals, andthe entries in the rows indi- 
cate that Aj has chosencertain of the other people 
in the group. In other words, the entries repre- 
sent the number of one-stage relations (or com- 
munications) between the individuals. 

The above matrix (8), is seen to be precisely 
the one used as an illustration in (3a). Now, let 
us multiply the matrix in (8) byitself. We obtain: 


Mj 


If we consider M, to be acom munication ma- 
trix, thenin the square of it, Mj, the entry in the 
ith row, jthcolumn gives the number of ‘‘2-stage’”’ 
communications between Aj and Aj. For example, 
A3 can communicate once with A] in two stages, 
while A2 cannot communicate with Aj intwo stages. 
We might say, then, that the entries represent the 
number of two-stage relations among the corres- 
ponding individuals. ‘ 

Now forming the sum, My + Mj, we obtain the 
matrix (10) shown at the top of column two on this 
page. 

From the interpretation of (8) and (9), we see 
that the entries in (10) represent the number of 





aa 
. The column sums of 
(10) M,+Mj,=| 11 1 (10) are (5, 5, 3) 
2s 2 9 


one- and two-stage relationships between corres- 
ponding individuals. Hence, the column sums will 
give the total number of one- and two-stage ‘‘in- 
ward’”’ relations (including the one with himself) 
enjoyed by each individual. This latter fact is 
used here as an indicator ofthe status of the indi- 
vidual within the group. (Therowsumscan be in- 
terpreted as ‘‘outward’’ one- and two-stage rela- 
tions or communications. ) 

Now, a consideration of the matrix in (10) and 
the original sociogram reveals some interesting 
facts. First, the diagonal of the matrix in (10) 
gives, for each of the ihdividuals, the number of 
‘*mutual choices’’ enjoyed by each one. Thus, Aj 
has two mutual choices, while Ag and Ag have only 
one each. As indicated above, the column sum 
vector (5,5,3), gives the relative status of each 
individual within the group. Thus, according to 
this characteristic vector, A, ahd A2 have equal 
status, while Ag has less than either of the others. 

Using this same group of three individuals, let 
us illustrate the manner in which the column-sum 
vector can be used to compare change in a given 
group from onetime to another. For this purpose, 
let us suppose that a subsequent application of the 
same procedure yielded the following sociogram: 





A3 
mn 


whose matrix is, 


~( 


The square of this matrix is 


2 
M, = ( 


and the sum, 





222 

2 
M,;+M,= | 2 2 2 
222 


The column-sum vector is (6,6,6). In other 
words, compared with the column-sum vector of 
the sociogram of (7), it is evident that all three of 
the individuals enjoy two mutual choices, as re- 
vealed by the diagonal values in(12), and, they are 
all of equal status. 

Remembering that these illustrations assume 
two choices for each individual, and two choices of 
equal strength, let us illustrate the utilization of 
the method with a more complex group, say, of eight 
individuals. First, let us consider the following 
sociogram for these eight: 


A 
e, 2 


Aj r 


Ag 


S. 


oa 








Ag 


The matrix for this sociogram would be 


oococoror oO 
ooroccoccor- 
orooococo 
orooocoror 
ke Or oococo 
Ke oOooroooo 
oococcorroo 
ooorcoro 


and the square of it is 


orr OOor ow 
-OoOOoOrFrFK OF © 
oooocorr oO 0 
oroonrrK oO 
Ke OOnocoorF oO 
-OoOF KF OOF CO 
onocorcr- 
re OoOnoocococr- 


The sum, C + C” is shown at the top of the next 
column, and the resulting column-sum vector is 


(7, 6, 3,8, 6, 6, 6, 6) 
(Ay Ag Ag Aq As Ag Az Ag) 


(16) V: 


From the values of the diagonal in (14), it is 
clear that Aj, A4, A5, and Aq each have two mu- 





orrF OF = 1 
- OF Fe FS Oe 
oroorr oO 
ON OCON NF 
Norn oOorF CO 
Norn OoOCrF © 
onoorN Ore 
-ONrFK OOK = 


tual choices to their credit. From the character- 
istic column-sum vector, it is seen that Aq has 
the ‘‘most’’ status, which fact can be substantiat- 
ed from the original sociogram, Sj, for the pur- 
poses of this discussion. Similarly, A is seen to 
have the next highest status designation, followed 
by A5, Ag, Az and Ag. The latter four are indi- 
cated by the vector sums as being of equal status, 
a fact which can again be checked in the original 
sociogram. 

It may be worth noting that the column-sum 
vector values of Aj and As are 7 and 6 respec- 
tively. From the sociogram, one would conclude 
that these two individuals have equal status on the 
basis of the fact that each enjoys two mutual 
choices. The values of 7 and 6 in the column-sum 
vector indicate the fact that Aj has one more two- 
stage ‘‘inward’’ relationinthe group than does A5, 
a fact which may be of some importance in the 
inter-group relationships. 

The interpretation of the column sums might 
be illustrated at this point by showing the meaning 
of the number ‘‘7’’ which is the value associated 
with Aj. Theseven ‘‘inward’’ one- and two-stage 
relations for Aj are as follows: 


Aj+—— Age Ag, 
Aj«e——  Age—— A 
Ay«—— Age A 
Aje——. Age A 
Aje—— _ Age——_ A7 
Aje—— A 

Aj«—— Aq 


By using the characteristic column-sum vector 
to identify the individuals with the most status, 
and using the original matrix, we are able to ident- 
ify the cliques in this group and associate with 
them astatus designation by using the column- 
sum vector for each individual in the clique. 

For example, from (16) we see that Aq has the 
highest column sum. From the diagonal entry in 
(14) we see that Ag enjoys two mutual choices 
which, from (13) are with A; and A7. Hence, Aq, 
Aj, and A7 can be said to form a clique. 

Similar analyses yield the following cliques, 
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the first three of which actually constitute a large and 
sub-group 

Aq Ay 

Ag Ag Aq 


Aj Ag A4 
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As Ag 
The characteristic column-vector for (20) is 
Now consider the same eight individuals with 


the following sociogram at some later time. (21) eS % Be eee SS 


(Ay, Ag, Ag, Aq, A5, Ag, Az, Ag) 


A 
1 e——> A2 and the cliques, with their characteristic column- 


LZ sum vectors, are 

Ag A; Ao Ag: (6, 7, 3) 

Ae Ag Ac: (10, 8, 8 

A3 3 Ag A5: | 
Al N It is evident that the individual with the most 
a i-7 4 status is A3, quite a change from the case in (13). 

5 
Ag 


The status of the other individuals can be read 
quite easily from (21). There are two strong 
cliques, the stronger of them being Ag, Aq, A5. 


The matrix representing this sociogram is 


Summary 


The use of the characteristic column-vector 
presented above appears toprovide a useful meth- 
od for quantifying sociometric data. Specifically, 
it affords a relatively simple procedure for de- 
termining a numerical value for the status of in- 
dividuals in a group. 

It should be emphasized, however, that the 
method presented above utilizes the notion of the 
incoming, one- and two-stage relations enjoyed 
by each individual in the group. This procedure, 
then, places additional emphasis not only upon how 
many chose a given individual, but whether those 
who chose him were chosen by others in the group. 

Finally, the utility of the method of the paper 
will depend upon additional research with socio- 
metric data to ascertain the significance of the 
two-stage relationships in interpersonal choices 
of individuals. 
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A TABLE TO FACILITATE THE CALCULATION 
OF NORMALIZED T-SCORES FOR SMALL 
SAMPLES (N>5<51) 


PETER F. MERENDA 
Walter V. Clarke Associates, Inc. 
East Providence, Rhode Island 


SEVERAL AUTHORS of psychological meas - 
urement and statistics texts, Garrett (1) and Ed- 
wards (2), for example, have included in the ap- 
pendix atable to facilitate the calculation of 
T-scores. The use of this table, however, re- 
quires the performing of all of the steps in the us- 
ual procedure for normalizing distributions and 
calculating T-scores. 

The table presented in this article requires 
only that the first four steps in the usual procedure 
be performed. The final step of calculating per- 





centages, whichis the most time-consuming of all 
the steps, before entering the T-score table is 
eliminated. In order to enter this table, all one 
needs to know are the number in the sample and 
cumulative frequency below a given score plus one 
half of the frequency of the given score. The ta- 
ble is divided into two parts: Table I for N’s be- 
tween 6 and 24 inclusive, and Table II for N’s of 
25-50 inclusive. The procedure and instructions 
for use with this table are given below with ex- 
ample: 


INSTRUC TIONS FOR USE WITH T-SCORE TABLE 


Step Procedure 


1. Prepare frequency distribution of scores. 


2. Cumulate scores, i.e., add each successive row of scores. 


. Tabulate the frequency below a score + 1/2 the frequency of a given score. 


. Enter T-score table with N (number in sample) and read standard normalized 
scores in the appropriate cell for values calculated in step 3. 


Example: 


N = 10 





Cum.f. below 
+1/2 f 


T-scores (Read 
from chart) 





9.50 
8.00 
5. 50 
3.00 
1.50 

. 50 
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TABLE I 


A TABLE FOR CALCULATING T-SCORES 
(N > 5 < 25) 
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This table should prove to be of value tocol- 
lege instructors and classroom teachers who de- 
sire to normalize distributions of scores of 
classroom tests and report scores as normalized 
T-scores. This table may be extended indefinite- 
ly, but it is felt that the maximum N of 50 will 
suffice for the great majority of classroom situa- 
tions. 
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ESTIMATION OF TEST RELIABILITY BY 
ANALYSIS OF VARIANCE TECHNIQUE’ 


CYRIL J. HOYT and P. R. KRISHNAIAH 
University of Minnesota 


Introduction 


IN AN EARLIER paper, Hoyt! showed a method 
of determining the best linear estimate of the er- 
ror variance appropriate for expressing an inter- 
nal consistency reliability coefficient for a total 
test score obtained by adding the scores on items 
scored either one or zero. In this derivation it 
was assumed that the pq errors of measurement 
in the responses of p students to eachof the q items 
were independent. Later investigation showed 
that the procedure described in this earlier paper 
is entirely appropriate when: 


1. items are scored in more than two categor- 
ies“, 
2. ‘‘items’’ are ratings, test scores or other 


measures», 


Further theoretical development in the area of 
components of variance estimation has enabled the 
authors to showthat certain assumptions regard- 
ing the independence of the errors of measurement 
and of independence of main effects (individuals 
and items) are not necessary in order that Hoyt’s 
1941 formulation of the reliability coefficient es- 
timate yield appropriate ratios of expected values 
of the mean squares. 

When item effects are assume to be intercor- 
related (see models 5 and 6) tie expected values 
of the two variance components (A) and (C) used in 
estimating the reliability coefficient are precisely 
equal to those obtained for models 1, 2, 3, and 4 
in which independence is assumed as in the 1941 
derivation. 

When errors of measurement are assumed to 
be correlated for the responses of a given individ- 
ual on different items (see models 7, 8, and 9) the 
expected values of both variance components A 
(among individuals) and C (error) are different 
from those obtained for the first six models. How- 
ever, the ratio of (A-C) to A still provides a suit- 
able estimate of the internal consistency reliabil- 
ity coefficients provided the user recognizes what 
he has. 





*All footnotes will be found at end of article. 


The definition of fixed, mixed and random mod- 
els follows: 

The model equation for each of the nine lines of 
Table lis Xjj= m+ ai + Aj + €ij where 


i= 
j= 


where p, j, 8; are the general effect, ith indi- 
vidual effect, and jth item effect. € ij'S areerrors 
which are normally distributed. This equation is 
suitable for all three types of models with differ- 
ent assumptions on the cc’sand ~’s. We may sup- 
pose that the individuals and the items are drawn 
randomly from their corresponding populations 
which contain P individuals and Q items respec- 
tively. 


A. Fixed Model 





When P = p, and Q = q, then the above equation 
represents a fixed model. In this case, the only 
random element is the error term €jj. When de- 
scribing the reliability coefficient for a given test 
form administered to a given group of individuals 
the fixed model is appropriate. 


B. Mixed Model 





When P = p and Q > q or when P > p and Q= 
q, then the equation represents a mixed model. 
(The symbol > is used to designate the relation 
as much greater.) This model is suitable for con- 
sidering the reliability obtained as an indicator of 
that appropriate for either measuring a given 
group of people on a domain of items represented 
by the sample of q items or, alternatively, for 
generalizing to a population of P subjects for any 
given administration of a certain test of q items. 


C. Random Model 








When P > p and Q > gq then the above equation 
represents a ! ‘om model. This model is suit- 
able for esti reliability coefficient for 
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TABLE I 


EXPEC TED VALUES OF MEAN SQUARES FOR NINE MODELS 





Model (A) Among Individuals (B) Among Items (C) Error 





Fixed model 
Individual item effects fixed A. Zz a” + + D4, 
Errors uncorrelated a ; 





Mixed model 

Errors uncorrelated 
Item effect random 
Individual effects fixed 





Mixed model 

Errors uncorrelated 
Item effects fixed 
Individual effects random 





Random model 

Errors uncorrelated 
Item effects random 
Individual effects random 





Random model 

Errors uncorrelated 
Item effects correlated 
Individual effects random 





Mixed model 

Errors uncorrelated 
Item effects correlated 
Individual effects fixed 


o* + p(l - p) oF 





Fixed model 


Errors correlated 
on different items for - sa; o*(1 - p) + nal =p; o*(1 - ~) 
same individual p- q- J 





Mixed model 

Errors correlated as in 7 

Individual effects fixed - " o*(1 - 0) + pos 
Item effects random P ’ 





Random model 

Errors correlated as in 7 

Item effects random o* | 1+(q-1)p] + qoy o*(1 - o) + pop 
Individual effects random ; 
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an infinite population of subjects or a finite one of 
size P, on the whole domain of Q items. 

Columns 2, 3, and 4of Table I show the expect- 
ed values of the mean squares components in nine 
different models that are each specified by the as- 
sumed characteristics listedin column one. These 
nine models are all useful for specifying internal 
consistency reliability coefficients if Xjj is con- 
sidered the score earned by i - th individual’s re- 
sponse to the j - th item on an examination. In 
these nine models the interaction term is used as 
error as is uSually done in cases providing only 
one observation percell. By making different as- 
sumptions about the parameters in the equation the 
nine different models each described in succes- 
sive lines of Table I arise. There are a number 
of other models which seem useful and for which 
we have not yet studied the variance components 
and reliability estimates. 

The first six entries in the last column of Ta- 
ble I are the expected values of the mean squares 
of the true scores or the numerator of the relia- 
bility coefficient appropriate for each line of the 
table. The denominators of the reliability coeffi- 
cients in every case are the entries in column A. 
In the models specified in the last threelines, the 
entries in the last column seem to contain more 
than the variance of the true scores. Hence, 
though these could be defined as the numerators of 
the reliability coefficients, they do not appear to 
be appropriate estimates for the variance of the 
true scores. 

Let us consider now the first four models in 
which all the effects represented by the terms of 
the equation are independent. In these cases, one 
notes that the usual procedure for estimating the 
internal consistency reliability coefficient gives 
adequate estimates of the ratio of the variance of 
the true scores to the variance of the obtained 
scores. The last column of Table I gives only the 
numerator of the reliability coefficient obtained by 
taking one minus the ratio of the error mean square 
to the among students’ mean square. The denom- 
inator of the reliability coefficient is (A) of the 
corresponding row or the mean square for among 
individuals. Rows 5 and 6 show expected values 
for random and mixed models, respectively, in 
which the item effects are correlated with one an- 





other to the same extent () for each pair of items. 
The column indicating the expected value of the 
‘‘among items’’ mean square gives an expression 
involving this common correlation / between items 
However, as long as the errors of measurement 
in the several items are considered independent of 
each other and independent of the item and indi- 
vidual effects, the usual mean square ratio for 
estimating the coefficient of reliability gives a nu- 
merator which is an appropriate estimate of the 
variance of the true scores. 

In models 7, 8, and 9, however, the numera- 
tor of the usual estimate of the reliability coeffi- 
cient contains a term oqo? which is not zero un- 
less © is zero. Inthese three models, it is as~ 
sumed that the errors of measurement on pairs of 
items are correlated with a common co-variance 
for each individual. 


FOOTNOTES 
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the Bureau of Educational Research, Univer- 
sity of Minnesota. 


. Cyril J. Hoyt, ‘‘Test Reliability Estimated by 
Analysis of Variance,’’ Psychometrika, VI 
(1941), pp. 153-60. 





. Cyril J. Hoyt and Clayton L. Stunkard, ‘‘Esti- 
mation of Test Reliability for Unrestricted 


Item Scoring Methods, ’’ Educational and Psy- 
chological Measurement, All (1951), pp. 


196-58. 





- Robert L. Ebel, ‘‘Estimation of the Reliability 
of Ratings,’’ Psychometrika, XVI (1951), 
pp. 407-24. 





. P. Horst, ‘‘A Generalized Expression for the 
Reliability of Measures,’’ Psychometrika, 
XIV (1949), pp. 21-31. 





. Howard W. Alexander, ‘‘The Estimation of 
Reliability When Several Trials are Avail- 
able,’’ Psychometrika, XII (1947), pp. 79-99. 











JOURNAL OF EXPERIMENTAL EDUCATION 
(Volume 28, Number 3, March 1960) 


ON TEACHING F INSTEAD OF t 


NORMAN H. ANDERSON 
Yale University* 


IT IS WELL known that the square of at ratio 
on n df is interpretable as an F ratioon1l andn df. 
For the case of oneor two means then, the values 
of t and F are equivalents, and lead to identical 
probability and inferential statements. Since F is 
the more general, applying as well to the case of 
more than two means, one might well suspect that 
the t test could be retired from use. Because the 
t test constitutes sucha large part of current sta- 
tistical pedagogy and practice, it may be useful to 
consider the matter in more detail. The main 
considerations which seem tothe writer to be rel- 
evant are summarized briefly below. ! These con- 
siderations are applicable principally to the first 
few statistics courses in the applied sciences and 
have less relevance to more theoretical courses. 

In favor of t, it canbe argued that it has great- 
er intuitive appeal. To measure the difference 
between two means against its standard error may 
be simplerto comprehend and result in easier 
learning than does the variance partitioning tech- 
nique of F. However, as soon as it is admitted 
that t is insufficient to meet everyday needs, so 
that F must be taught anyway, this argument loses 
its power. It may also be suggested that whatever 
greater immediacy of meaning may attachto t is 
not entirely its natural due, but rather a product 
of the present organization of statistics courses. 

The t test may also be preferred because of its 
relation to some of the problems arising in esti- 
mation, such as the construction of confidence in- 
tervals. Algebraically, of course, it makes little 
difference whether one proceeds fromt orfrom F. 
Thus, with F, the 95 percent confidence interval 
is obtained using Vs; F(.95;1,n). The interpre- 
tation of the confidence interval could then be tak- 
en on faith (and even serve usefully as the basis 
for the significance test for the case of one or two 
means). However, it must be admitted that the 
confidence interval idea seems easier to compre- 
hend when it is based on the standard picture of 
the symmetrical t distribution. This possible ad- 
vantage of t must be measured against its short- 
comings in problems of significance testing which 
are discussed below. 

A further point in favor of t stemsfrom a dan- 








*Footnotes will be found at end of article. 





ger of the analysis of variance. Ordinarily, when 
pair-wise comparisons among several conditions 
are to be made, the error mean square from the 
overall F analysis is used throughout. When the 
condition variances are reasonably equal, this 
procedure gives increased sensitivity since the 
error is estimated more precisely. If there is 
marked heterogeneity of variance, however, such 
tests may be biased, and it may be preferable to 
test each pair of conditions using only the data 
from that pair (although this does not entirely es- 
cape the difficulty). Of course, the two condition 
F could be used here as easily ast. Neverthe- 
less, it must be admitted that routine application 
of the analysis of variance does facilitate over- 
looking the problem. 

Against these arguments must be set a number 
of negative aspects of t. For situations with just 
one or two conditions, of course, it is a matter of 
statistical indifference whether tor F is used. 
Either yields the same inferences, as noted above. 
Moreover, since t?= F, there need be little differ- 
ence in computational efficiency. It may be noted, 
however, that the standard computing rules for t, 
in contrast to those for F, are not efficient and 
are thus more liable to arithmetical error. 

When more than two conditions are involved 
piece-meal t testing presents certain dange rs.2 
The one commonly em phas ized is the increased 
risk of thereby obtaining ‘‘significant’’ differences 
which the data do not in fact support. Curiously 
enough, the t test approach also increases the 
likelihood of the obverseerror, namely, failing to 
detect results which the data are quite adequate to 
declare real. This can happen in a number of 
ways, which trend tests may serve to illustrate. 
In investigations in which one of the independent 
variables is numerical, the between conditions 
variation can often be expected on prior grounds 
to reside largely in the linéar, or linear + quad- 
ratic, component of the curve. Pair-wise com- 
parisons of means may fail to reveal even a fairly 
strong real effect which a trend analysis F wili 
elucidate with ease. 

A more fundamental objection to the t test ap- 
proach concerns the original planning of the inves- 
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tigation. The researcher who thinks in terms of 
t is less likely to use as incisive and informative 
a design as can be had for the allotted time and ef- 
fort. Even when the basic plan is sound, a poor 
superstructure can, and not infrequently does, 
prejudice the analysis and interpretation, or en- 
tail considerable extra computational cost. 

Fisher’s commendation of the use of factorial 
designs for controlling extraneous variation may 
be cited in illustration of this last point. Even 
with only two main experimental conditions, it is 
often desirable to counterbalance a second vari- 
able, not of great interest in itself, in order to 
obtain added generality. Unless set up so that the 
two-way analysis canbe and is used, however, the 
counterbalancing may do more harm than good. 

In a similar way, it is often possible to use as 
a second independent variable some individual dif- 
ference measure which is correlated with the re- 
sponse measure under study. This procedure, 
essentially a ‘‘randomized bl ocks’’ factorial de- 
sign, may generally be expected not only to in- 
crease the sensitivity of the experiment, but also 
to yield information on the organismic variables. 

That the F test is a desirable piece of statis- 
tical machinery hardly needs even the incidental 
justification given here. But if F is to be taught 
at all, then it evidently follows, since Fis the 
more general, that F alone should be taught. To 
require the mastery of two sets of rules where 
one will do is an improvident use of the student’s 
time. 

For statistics courses in which time is insuf- 
ficient for both tests, the choice of F would ac- 
complish the same ends asthechoiceoft. If more 
time were available, the elimination of t would 
permit the development of some of the simpler 
ramifications of the analysis of variance, such as 
two-way designs, specific comparisons, and trend 
tests. Moreover, teaching Ffrom the start would 
set up conditions for maximal positive transfer 
later in learning since the computing rules for F 
follow auniform and elegant logic in all situations. 
It might also be noted in this connection that the 
computational procedures for correlation and re- 
gression analysis can be subsumed under the anal- 
ysis of variance format as well. 

Finally, it may be questioned whether learning 
the t test does not produce a fair amount of nega- 
tive transfer. The evidence available to the writ- 
er, admittedly anecdotal in nature, suggests that 
students who have learned t are reluctant to aban- 
don this skill when necessary. Havinga technique 
which may actually, if not always advisedly, be 
used, they tend to regard even workaday designs 
as ‘‘fancy’’ or ‘‘not worththe trouble at this stage 
of investigation.’’ Thus, if they use the design at 
all, they do so only with pain, and so fail to ac- 
quire the familiarity with the new technique that 
would come were it regarded as just another ap- 
plication of the F test they already knew. 





It would thus appear that t is superfluous, since 
F may always be computed instead; inadequate, 
since it does not apply as generally as does F; 
dangerous, since its use may increase errors of 
inference, as well as lead to weak experiments; 
and a waste of time, since teaching F will provide 
more transfer to later learning. On the balance, 
it seems fair to conclude that the t test should be 
struck from the curriculum. 


More practical experience is neededin order to 
integrate the analysis of variance with beginning 
statistics on the conceptual level. However, it is 
appropriate to sketch briefly the sequence of com- 
putational routines which is indicated by the argu- 
ment of this note. 

1. Computation of the sample variance is done 
using the analysis of variance table: 


Source df SS MS 





Mean 1 T?/N 


Error 


(Variability) N-1 © X*-T?/N 





The entry, MSerror, then furnishes an unbiased 
estimate of the population variance. The scaffold- 
ing provided by the tabular format is unnecessary, 
of course, but seems to serve a useful mnemonic 
function. Its real benefit accrues in Step 2. 

2. The test of significance for a single sample 
is most appropriately carried out on difference 
scores, or on other scores which have a mean of 
0 under the null hypothesis. Asingle division ap- 
pended to the Step 1 routine yields the relevant F 
ratio. 

3. With two samples of difference scores, the 
analysis of variance table is extended one step 
further: 


Source df Ss MS 





Mean 1 


Between 
Groups 1 


Error N-2 





4. It is straightforward to extend Step 3 to the 
case of several samples and it seems advisable to 
do so promptly in order to avoid the development 
of a superstition that the technique only works for 
one or two. The routine remains the same when 
the scores are arbitrary, rather than difference 
scores. Itis only necessary to observe that the 
F for Mean is no longer meaningful or important 
and may be omitted from the table. 
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5. A simple extension of the computing routine 
yields the calculation of r for a single sample: 


Source xXx XY YY 





¥ 2 / 
Mean Ty/N TyTy/N Ty/N 


Error 


=X*-T,/N =EXY-TyTy/N LY*-TY/N 





The square of the correlation coefficient is esti- 
mated from the entries in the Error line, r? = 
SSyy ‘SSyxSSyy- The test for significance may 
be made by consulting a table for r or, with an 
additional calculation, by using F. When the cal- 
culation of r is taught this way, there is less like- 
lihood of confusedly agglomerating the between 
and within samples regressions when more than a 
single sample is involved. 

It is suggested that >X* be given for most of 
the homework problems. This will greatly de- 
crease the time spent on each problem and, by 
thus allowing a greater variety of problems to be 
assigned, increase the learning efficiency. Wheth- 
er or not further applications of the analysis of 





variance are given in the first course will depend 
on the available time, and on whether the course 
is oriented toward experimental or observational 
research. 


FOOTNOTES 
* Nowat the University of California, Los Angeles. 


1. For the purposes of this note, the terms ‘‘F 
test’’ and ‘‘analysis of variance’’ will be taken 
as synonymous although this is not strictly cor- 
rect. It will be necessary to allude to a num- 
ber of uses of the analysis of variance, but 
these uses are well known and will not be doc- 
umented. 


. Aspecific comparison based on some reason- 
ably firm rational or empirical ground may al- 
ways be tested, of course. However, unless 
it is based on a single degree of freedom, it 
will require an F ratio in its own right. It 
should also be specifically noted that multiple 
comparison procedures will be superior to F 
(and t) in certain situations. 
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COMPUTATION OF VARIANCE ACCOUNTED 
FOR IN MULTIPLE CORRELATION 


CLINTON I. CHASE 
Idaho State College 
Pocatello, Idaho 


WHEN MULTIPLE correlation techniques are 
used in the analysis of data, the investigator usu- 
ally wishes not only to discover a combination of 
variables which will yield maximum prediction of 
a criterion, but also he wishes to determine the 
amount of variance in the criterion which is asso- 
ciated with each of the independent variables. For 
this purpose Engelhart(1) has submitted the 
formula: 


2 2 2 2 
R* = Bo. + Bo2 +--+Bon + 2Bo:Bo2Ti2 + 


2Bo:Bosis +--+ 280,n-18onTn-15n [1] 


Following this procedure, the ith beta squared will 
produce the variance directly associated with the 
ith variable. But since the ith variable is usually 
correlated with other independent variables, it al- 
so enters into variance which is shared by several 
of these other independent variables. The total 
amount of this shared variance is given by the 
summation of all of the terms of the form 2Boi8oj 
rjj which appear in formula[1]. Then, according 
to Engelhart’s procedure, the shared variance is 
divided among each of the independent variables 
in the same proportions as the direct variance. 
The total amount of variance associated with the 
ith variable, then, would be the ith beta squared, 
plusa proportional amount of the total shared 
variance. 

Since Engelhart’s procedure for partitioning 
the shared variance is admittedly arbitrary, the 
purpose of this paper is to present an alternate 
procedure, which, itis believed, is somewhat less 
arbitrary in its method of apportioning shared 
variance, since here the basis for partitioning the 
shared variance is found in mathematical manip- 
ulations. 

As a necessary partofthis presentation a sec- 
ond technique (2:409-10) for computing the vari- 
ance inthecriterion associated with any given in- 
dependent variable must be noted. Such a tech- 
nique is provided in the formula: 








S f 
R* = Boilo. + Bo2Toz + BosTos +. -BonTon 


2] 


Using this procedure, one may find the total 
amount of variance associated with the ith vari- 
able in the product Soiroi- Although form ula[ 2] 
is simpler to use than formula[1], especially when 
the number of independent variables is more than 
two or three, it has the disadvantage of not yield- 
ing a break-down bet ween variance which is di- 
rectly associated with the ith variable, and that 
portion of the variance shared among the several 
independent variables which is to be associated 
with the ith variable. 

It therefore seems that some technique is de- 
sirable which would retainsimplicity and economy 
of effort, while yielding a maximum amount of in- 
formation. Such a system is found in the mathe- 
matical analysis of formulas [1]and [2] as shown 
below. 

In a set of normal equations for the least- 
square estimate of the betas for the multiple lin- 
ear-regression equation, the r’s are found as fol- 
lows: 


To: = Boi + TizBo2 + TisBos + Ti4 Bog 
Io2 = Boz + Ti2hoi + T2sBos + T24Bo4 
Tos = Bos + TisBoi + T2sBo2 + T34Bog 
T'o4 = Boa + Tigfor + T24Bo2 + T34 Bos 


Therefore, the right hand portion of | 3] can be 
substituted for ro, in formula[2], the right hand 
side of [4] can be substituted for ro, in[2], and 
soon. The result would be as follows: 


R* = Bo (Bo: + Ty2Po2 + Tisfos + Ti4 Boa) + 
Bo2(Bo2 + Ti2Bo: + T2sBos + T248o4) + [7] 
Bos(Bos + TisBoi + T2sBo2 + T3484) + 
Boa(Bos + Ti4Bo. + T24Bo2 + T34 Bos) 


Multiplyingto eliminate the parentheses one may 
find the following results: 


*The author wishes to acknowledge the advice received from R. F. Jarrett of the Department of Psychol - 


ogy, University of California, Berkeley, California. 





JOURNAL OF EXPERIMENTAL EDUCATION 


+ 


Bor + TizBo2Po: + TisBosho: + Tis Fos Pon 
Bae 4 Pi2zPoiPo2 ? FasPosho2 P'24ho4ho2 as 8] 
38s +TisPoifos + FasBo2os + ls4 Bo4hos + 


JAe + Tn Boi Bos + T24Bo2Bos + T34Bosho- 
It can now be seen that sum m ating like terms in 


(8] will produce formula[ 1] as it would be written 
in a four variable problem: 


° »2 ) , 
+ Pos * Po4T 2Boiho02Ti2 + 


R° =| 01 + Boe 
2Boi/ osTis * 28018048 14 2Bo2hosle2s3 t [ 9] 


Bo2Bo4t24 + 2BosBo4ls 


Therefore, from formula [3] and [7] it is apparent 
that the combined direct and shared variance as- 
sociated with variable one, as given by the fojroij 
product in formula [2], is actually equal to the beta 
squared, plus one-half of the summation of all the 
covariance terms in formula[{9], of the form 
28o:8ojf:j, Which include the beta of variable one. 
Likewise, it can be generalized that the total di- 
rect and shared variance in the criterion associ- 
ated with the ith independent var iable is given by 
the square of the beta for the ith variable, plus 
half of all the covariance terms in formula [1] 
which include the beta for the ith variable. 

It appears, then, from a mathematical view- 
point that the variance which the ith variable shares 
with other variabies will not necessarily be dis- 
tributed in the same proportion as is the variance 
directly associated with that variable, as Engel- 





hart proposed. Noting the relationship between 
formula [ 1]andformula[2], one may now compute 
both the variance whichis directly associated with 
the ith variable, andthat portionof variance shared 
among the independent variables which is to be as- 
sociated with the ith variable, by using the sim- 
pler procedure as given by formula [2], with the 
addition of a further, simple computation. First, 
the Soiroj product is found, yielding the total di- 
rect and shared variance associated with the ith 
variable. Then So; is squared, producing the var- 
iance directly associated with the ith variable. 
This figure is now subtracted from the Sojroj pro- 
duct, the difference beingthat portion of the vari- 
ance, shared by the independent variables, which 
is associated with the ith variable. 

It is believed that this method provides two ad- 
vantages over the procedures represented by either 
formula [1] or [2] alone: first, it yields maximum 
information concerning a given variable’s partici- 
pation in the variance accounted for in the criter- 
ion; and second, it materially reduces the number 
of computations necessary to secure that amount 
of information. 
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ON A TEST OF WHETHER TWO SETS OF OB- 
SERVATIONS BELONG TO A CERTAIN CLASS 
OF DISTRIBUTIONS WHEN THE RANDOM 
VARIABLES ARE NOT INDEPENDENT 


H. P. KUANG* 
University of Minnesota 


THERE EXIST several techniques which have 
been developed in recent years to test the hypoth- 
esis that two samples have been drawn from the 
same population. These developments were based 
on signs, ranks, or the randomization principle. 
For instance, to test the hypothesis that the two 
samples are from the same population, Mood con- 
sidered a test based on signs while Dixon pre- 
sented a statistic based on ranks. Utilizing the 
randomization principle, Pitman proposed the 
difference of sample means asa criterion, large 
values being significant. Wald and Wolfowitz de- 
vised the total number of runs in a sequence for 
testing the same hypothesis thattwo samples (not 
necessarily of equal size) have been drawn from 
acommon population. The tests developed are 
essentially limited to the continuous and independ- 
ent variables. We shall, however, treat the prob- 
lem without these restrictions. It is the purpose 
of this paper to present a probabilistic inequality 
which may be used todetermine whether two vari- 
ables are contained in a certain set of distribu- 
tions without the assumptions of continuity and in- 
dependence. 

Let us consider the two random variables X 
and Y which are not independent. Suppose that 
each of the variables is divided into a finite num- 
ber of parts A, Ar, and let the correspond- 
ing discrete marginal distribtuions of X and Y be 
F, = { pj] and F, = (ai } (i = 1 -- r) respectively, 
so that pj = P(X = Aj), qj = P(Y= Aj) and 


> pi =1, gel 
ie = is 


Let, further, the corresponding observations X 
and Y be {nj} and {mj} (i= 1 ---r) respective- 
ly, so that the sample values nj and mj belong 
to the set Aj and 








mj = m. 


r 
> 


i=1 
Any set of Aj carries the mass nj in the distribution 
n 
of X and the mass mj inthedistributionof Y. The 
m 
object here is to decide on the basis of two sets of 
observations {nj} and { mj} (i = 1 --- r) whether 
the two distributions F, and F, belong to the same 
class of distributions. 

Using the Minkowski inequality, the author has 
proved in his unpublished dissertation (University 
of Minnesota) that if the two distributions F, and 
F, belong toacertain set of distributions, the fol- 
lowing probabilistic inequality must hold: 

24 r-1 


2 
€ ~ r 
P{O(F., Fa) >1-€ } dion ian 


oe = 
vn Vm 

r o/h Mi ; 
where O(F, F2) = 2 A m and€ is any posi- 

l= 
tive numerical value. It should be noted that the 
derivation of the inequality was based on the con- 
ditions thatn > r, m> rr, and that when nj <1 or 
when mj<1, i#1. By use of the above result we 
can then evaluate an upper bound of the probability 
in terms of a pre-assigned risk, a. It is easily 
seen that if F, and F, have the same distribution, 
we get 


2 2 
i. _£ r° +r-l l P 1 
OFF) >1 ay a (F- 7) 


As a numerical illustration, consider the two 


*The author is currently Professor of Mathematics at The North Carolina Agricultural and Technical 


College in Greensboro, North Carolina. 
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distributions of grades which were received by 
students enrolled in courses in calculus and dif- 
ferential equations at a university. The number 
of students assigned to each of five grades A, B, 
C, D, and F for these twocourses were as follows: 


A B Cc D F Total 


Calculus 12 52 57 48 200 
Differential 
Equations 2 20 44 50 44 160 





The calculus was aprerequisitefor the course 
in differential equations. Thesetwo courses were 
composed of not only the same individual students 
but also some twins, brothers and sisters. Very 
few of these students were, however, transfers. 
It frequently happened in mathematics courses 
that students with a good record in calculus usu- 
ally made a good grade in differential equations. 
On the other hand, students with a poor grade in 
calculus were ordinarily unable to do satis fac- 
tory work in differential equations. For these 
reasons, the two distributions of grades are not 
independent. Accordingly, it is difficult to see 
how the ordinary test based on the assumption of 
independence could possibly be used. We now 
wish to know whether the two distributions of 
grades have come from the same population with- 
out the customary restriction of independence. 
The table at the bottom of this page gives the nu- 
merical calculations of O(F, F.). 

From the last row of the table, we obtain 





4 nj mj; 
O(F, F2) * iva am = +989 


If we use a risk, a, of .05, say, we get 


2 
[283-1 ( 1 —_ = ,5395. 
V 05 ¥200 V160 
2 


€ 
Thus O(F,F.) >1 - 7 = +730 


Therefore, the two distributions of grades in 
question belong to the same set of distributions. 

Finally, it may be added that if the random 
variables X and Y are assumed to be independent, 
we then have 


2 2 
p{(F,, F2) >1 . £} >1- rtd) a in 





1 1 256(r? + r - 1)? 
n2* m2) * €® n2 m2 


when F, =F, . 

It canbe readily shown that the previous result 
(without the assumption of independence) is a bet- 
ter bound of probability when the following inequal- 
ity holds: 


15(n? + m2) - 4Vvnm (Vn + Vm)? > Peed 








Calculus 
n 


Differential Equation = 


/nj mj 


n m 
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The text not only relates how the 
various statistical techniques work, 
but also why the techniques are used 
and how their properties are derived. 
When interpreting statistical methods, 
the book continually stresses ap- 
proaching them by intelligent, scien- 
tific analyses. After stating a statistical 
method, the text always points out 
why such methods are advantageous. 
The basic theory of each method is 
first stated and then the various ways 
in which it may be used are given. 
Realistic, practical problems are given 
as illustrations of the theory in use. 
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